I have a dataset "excluded_id" that has unique people identified by varA varB varC
And then a master dataset "all_claims", with multiple observations per unique person (as identified by a unique combo of varA varB varC)
How might I drop all observations in "all_claims" tied to a given person, if that person shows up in excluded_id?
Basically the pseudocode is something like:
data final;
set all_claims;
where varA varB varC not in (excluded_id);
The trick is that each unique person is identified by a unique combo of varA varB varC...
run;
** UNTESTED CODE **
Assumes both data sets are sorted by varA varB varC.
data final;
merge all_claims(in=in1) excluded_id(in=in2);
by varA varB varC;
where in1 and not in2;
run;
** UNTESTED CODE **
Assumes both data sets are sorted by varA varB varC.
data final;
merge all_claims(in=in1) excluded_id(in=in2);
by varA varB varC;
where in1 and not in2;
run;
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.