Re: base sas

knveraraju91 · Posted 05-19-2016 05:18 PM

Dear ,

Ihave two large SDTM data sets. I need to find the number of subjid which are present in one datset and not in other dataset and viceversa. Is there any code that I can use.Thanks.

Thanks

ChrisNZ · Posted 05-19-2016 06:30 PM

If you can index your tables by subjid, you can then do this:



data WANT;
  merge TAB1(keep=SUBJID in=A) 
             TAB2(keep=SUBJID in=B);
  by SUBJID;
  if first.SUBJID and not(A and B);
  if A then SOURCE='TAB1'; else SOURCE='TAB2';
run;

High-Performance SAS Coding - Third Edition

PGStats · Posted 05-19-2016 10:42 PM

Or use SQL

proc sql;
create table want as
select "TAB1" as source, subjid from TAB1 where subjid not in (select subjid from TAB2)
union all
select "TAB2" as source, subjid from TAB2 where subjid not in (select subjid from TAB1);
quit;

PG

FreelanceReinh · Posted 05-20-2016 04:28 AM

If you're only interested in how many distinct, non-missing SUBJIDs there are in the first dataset and not in the second and vice versa:

proc sql;
select count(subjid) as only_in_tab1
from ((select subjid from tab1)
      except
      (select subjid from tab2));

select count(subjid) as only_in_tab2
from ((select subjid from tab2)
      except
      (select subjid from tab1));
quit;

knveraraju91 · Posted 05-25-2016 12:26 PM

Thank you all

base sas

Re: base sas

Re: base sas

Re: base sas

Re: base sas

SAS Innovate 2025: Save the Date

SAS Training: Just a Click Away