BookmarkSubscribeRSS Feed
Akshaya_1397
Obsidian | Level 7

Hi all,

I have two completely different datasets.

Dataset1 -clm has variables counterparty,id,name

Dataset2-gsl has variable splr_name,relid,phn

 

Now I need to compare the counterparty name in clm and splr_name in gsl in 

New dataset and report the not matching ones.

 

Can anyone please help me on this.

Can I simply use proc compare or is there any other way .

2 REPLIES 2
HenryKobus
Obsidian | Level 7

proc compare check on observation level meaning check osb 1 from clm is equal to obs 1 from gsl. I unterstand your task is to catch observations which have no match in the opposite table. So my solution is:

proc sort data = clm;
	by counterparty;
run;

proc sort data = gsl;
	by splr_name;
run;

data 	clm_only (rename =(joinvar = counterparty ) drop = relid phn)
	gsl_only (rename =(joinvar = splr_name ) drop = id name);
merge   clm (in =inclm rename =(counterparty = joinvar))
	gsl (in =ingsl rename =(splr_name = joinvar));
by joinvar;
if ingsl eq 0 then output clm_only;
if inclm eq 0 then output gsl_only;
run;
ballardw
Super User

@Akshaya_1397 wrote:

Hi all,

I have two completely different datasets.

Dataset1 -clm has variables counterparty,id,name

Dataset2-gsl has variable splr_name,relid,phn

 

Now I need to compare the counterparty name in clm and splr_name in gsl in 

New dataset and report the not matching ones.

 

Can anyone please help me on this.

Can I simply use proc compare or is there any other way .


It is not clear what you mean by "compare" and "not matching". Proc Compare is a observation by observation comparison by default. So is likely not the tool unless the data sets are expected to have the same number (or at least pretty close) of observations with similar values.

 

Do want just properties of the variables such as length, format and label?

Or values that appear in one data set but not the other? This code will create two data sets with one observation of each value that appears in one data set but not the other. I think the names of the sets explain the contents. Obviously untested as we do not have your data sets.

proc sql;
   create table in_clm_not_gsl as
   select distinct counterparty as value from clm
   except
   select distinct splr_name as value from gsl
   ;
   create table in_gsl_not_clm as
   select distinct splr_name as value from gsl
   except
   select distinct counterparty as value from clm
   ;
quit;

Warning: These are equal comparisons. So if your variable is character case differences will be reported as "BOB" is not the same as "Bob".

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1140 views
  • 2 likes
  • 3 in conversation