Hi,
I am attempting to merge two databases. To protect confidentiality, one of the databases had all of it's unique identifiers removed, so there is no single variable that is unique for each subject AND present in both databases that I can use for the merge. HOWEVER, I have found that the combination of two date variables (let's call them VAR1 and VAR2) IS unique for each subject.
Is there any easy way to merge based on these two variables (instead of just using one unique variable which I don't have)? My code is below, but I don't think you can list the two variables this way. I sorted on the two variables first.
Proc sort data=clean.INPROGRESS;
by VAR1 VAR2;
run;
proc sort data=clean.ORIGINAL;
by VAR1 VAR2;
run;
data clean.COMBINED;
merge clean.INPROGRESS (in=a) clean.ORIGINAL ;
by VAR1 VAR2;
if a;
run;
Please help! Thank you.
Run it. It should work assuming your analysis of the unique nature of the paired variables is correct.
BY processing can use a considerable number of variables. The number doesn't change the basic behavior.
Run it. It should work assuming your analysis of the unique nature of the paired variables is correct.
BY processing can use a considerable number of variables. The number doesn't change the basic behavior.
Hi,
It does work. I just needed to exclude missing VAR1 values from the original databases prior to the merge. Thanks!
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.