Solved: Re: Keep non unique IDs with non-repeated codes

Satori · Posted 03-02-2023 06:50 AM

I have a dataset like this (below). The variable code takes 4 possible values

The ID variable is not unique. There can be an ID with 2, 3 or 4 codes, although it's rare with 3 and 4. I want to keep the non-unique IDs that have different codes (or conversely, remove the non-unique IDs with repeated codes).

What I have:

data have;
input Obs ID $12. code $;
cards;
1 AC0000037163 C1
2 AC0000037163 U1
3 BE0000037282 U1
4 BE0000037282 U2
5 CZE0000037693 C2
6 CZE0000037693 C2
7 FR0000037738 U2
8 FR0000037738 C2
;

What I want:

data want;
input Obs ID $12. code $;
cards;
1 AC0000037163 C1
2 AC0000037163 U1
3 BE0000037282 U1
4 BE0000037282 U2
7 FR0000037738 U2
8 FR0000037738 C2
;

PeterClemmensen · Posted 03-02-2023 06:53 AM

data have;
input Obs ID $12. code $;
cards;
1 AC0000037163 C1
2 AC0000037163 U1
3 BE0000037282 U1
4 BE0000037282 U2
5 CZE0000037693 C2
6 CZE0000037693 C2
7 FR0000037738 U2
8 FR0000037738 C2
;

proc sql;
   create table want as
   select *
   from have
   group by ID
   having count(distinct code) > 1
   ;
quit;

The DATA to DATA Step Macro
Blog: SASnrd

View solution in original post

PeterClemmensen · Posted 03-02-2023 06:53 AM

data have;
input Obs ID $12. code $;
cards;
1 AC0000037163 C1
2 AC0000037163 U1
3 BE0000037282 U1
4 BE0000037282 U2
5 CZE0000037693 C2
6 CZE0000037693 C2
7 FR0000037738 U2
8 FR0000037738 C2
;

proc sql;
   create table want as
   select *
   from have
   group by ID
   having count(distinct code) > 1
   ;
quit;

The DATA to DATA Step Macro
Blog: SASnrd

Keep non unique IDs with non-repeated codes

Re: Keep non unique IDs with non-repeated codes

Re: Keep non unique IDs with non-repeated codes

Registration is open

SAS Training: Just a Click Away