Hi,
I am attempting to merge two databases. To protect confidentiality, one of the databases had all of it's unique identifiers removed, so there is no single variable that is unique for each subject AND present in both databases that I can use for the merge. HOWEVER, I have found that the combination of two date variables (let's call them VAR1 and VAR2) IS unique for each subject.
Is there any easy way to merge based on these two variables (instead of just using one unique variable which I don't have)? My code is below, but I don't think you can list the two variables this way. I sorted on the two variables first.
Proc sort data=clean.INPROGRESS;
by VAR1 VAR2;
run;
proc sort data=clean.ORIGINAL;
by VAR1 VAR2;
run;
data clean.COMBINED;
merge clean.INPROGRESS (in=a) clean.ORIGINAL ;
by VAR1 VAR2;
if a;
run;
Please help! Thank you.
Run it. It should work assuming your analysis of the unique nature of the paired variables is correct.
BY processing can use a considerable number of variables. The number doesn't change the basic behavior.
Run it. It should work assuming your analysis of the unique nature of the paired variables is correct.
BY processing can use a considerable number of variables. The number doesn't change the basic behavior.
Hi,
It does work. I just needed to exclude missing VAR1 values from the original databases prior to the merge. Thanks!
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.