Solved: Remove observations from one dataset based upon a list in another

yeaforme · Posted 10-27-2013 10:44 AM

I have two datasets. Dataset A has all the raw data where each subject ID code has multiple (sometimes thousands) of observations for it. Dataset B has a list of subject ID codes that need to be removed from Dataset A (i.e., if the ID code is in Dataset B then all observations linked to the ID code need to be removed from Dataset A).

I'm thinking it would seem like a job for proc sql, but don't know how to go about it. Thanks!

Jagadishkatam · Posted 10-27-2013 10:53 AM

proc sort data=dataset_A;

by id;

run;

proc sort data=dataset_B;

by id;

run;

data want;

merge dataset_A(in=a) dataset_B(in=b);

by id;

if a and not b;

run;

by proc sql;

proc sql;

create table want as select a.* from dataset_A as a, dataset_b as b where a.id^=b.id;

quit;

Thanks,

Jagadish

Thanks,
Jag

View solution in original post

Jagadishkatam · Posted 10-27-2013 10:53 AM

proc sort data=dataset_A;

by id;

run;

proc sort data=dataset_B;

by id;

run;

data want;

merge dataset_A(in=a) dataset_B(in=b);

by id;

if a and not b;

run;

by proc sql;

proc sql;

create table want as select a.* from dataset_A as a, dataset_b as b where a.id^=b.id;

quit;

Thanks,

Jagadish

Thanks,
Jag

yeaforme · Posted 10-28-2013 12:02 AM

The SQL code creates a cartesian product that takes forever to run before timing out after it has consumed all available memory, but the data merge code above it works well.

Thanks!

Remove observations from one dataset based upon a list in another

Re: Remove observations from one dataset based upon a list in another

Re: Remove observations from one dataset based upon a list in another

Re: Remove observations from one dataset based upon a list in another

Catch up on SAS Innovate 2026

Remove observations from one dataset based upon a list in another

Re: Remove observations from one dataset based upon a list in another

Re: Remove observations from one dataset based upon a list in another

Re: Remove observations from one dataset based upon a list in another

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away