Hi,
My dataset contains duplicate, triplicate or even more subject IDs. However, I do not know which duplicate is correct so I would like to remove the IDs with more than 1 count completely.
For example, there are two subject ID#1002. I would like to remove both subject ID#1002 completely. I would also like to export all the subjects who have been removed to a dataset so I can keep track of who has duplicates in the dataset.
Thank you all in advance!
will this help?
proc sort data=have;
by id;
run;
data unique NOUNIQUEKEY;
set have;
by id;
if first.id and last.id then output unique;
else output NOUNIQUEKEY;
run;
will this help?
proc sort data=have;
by id;
run;
data unique NOUNIQUEKEY;
set have;
by id;
if first.id and last.id then output unique;
else output NOUNIQUEKEY;
run;
Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.