Hi,
My dataset contains duplicate, triplicate or even more subject IDs. However, I do not know which duplicate is correct so I would like to remove the IDs with more than 1 count completely.
For example, there are two subject ID#1002. I would like to remove both subject ID#1002 completely. I would also like to export all the subjects who have been removed to a dataset so I can keep track of who has duplicates in the dataset.
Thank you all in advance!
will this help?
proc sort data=have;
by id;
run;
data unique NOUNIQUEKEY;
set have;
by id;
if first.id and last.id then output unique;
else output NOUNIQUEKEY;
run;
will this help?
proc sort data=have;
by id;
run;
data unique NOUNIQUEKEY;
set have;
by id;
if first.id and last.id then output unique;
else output NOUNIQUEKEY;
run;
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.