@Akshaya_1397 wrote:
Hi all,
I have two completely different datasets.
Dataset1 -clm has variables counterparty,id,name
Dataset2-gsl has variable splr_name,relid,phn
Now I need to compare the counterparty name in clm and splr_name in gsl in
New dataset and report the not matching ones.
Can anyone please help me on this.
Can I simply use proc compare or is there any other way .
It is not clear what you mean by "compare" and "not matching". Proc Compare is a observation by observation comparison by default. So is likely not the tool unless the data sets are expected to have the same number (or at least pretty close) of observations with similar values.
Do want just properties of the variables such as length, format and label?
Or values that appear in one data set but not the other? This code will create two data sets with one observation of each value that appears in one data set but not the other. I think the names of the sets explain the contents. Obviously untested as we do not have your data sets.
proc sql;
create table in_clm_not_gsl as
select distinct counterparty as value from clm
except
select distinct splr_name as value from gsl
;
create table in_gsl_not_clm as
select distinct splr_name as value from gsl
except
select distinct counterparty as value from clm
;
quit;
Warning: These are equal comparisons. So if your variable is character case differences will be reported as "BOB" is not the same as "Bob".
... View more