I know if I code like:
proc sort data=zzz nodupkey;
by aa cc; run;
it will remove duplicate entries which have identical values in aa & cc. How can I see, instead of removing, the duplicates which got removed?
Much thanks.
Regards.
proc sort data=zzz nodupkey dupout= xyz;
by aa cc;
run;
dupout=
proc sort data=zzz nodupkey dupout= xyz;
by aa cc;
run;
Here is a link to the documentation for DUPOUT: https://documentation.sas.com/?docsetId=proc&docsetTarget=p02bhn81rn4u64n1b6l00ftdnxge.htm&docsetVer...
Hi @d6k5d3 While dupout and such options offer pretty straight forward solutions, I'd recommend to get intuitive knowledge on by group processing in SAS, SQL, Proc freq, and in general counting techniques. A grasp of that will make you comfortable. The idea is to differentiate unique records, unique keys etc. The more you dig, you will venture into other concepts such as indexes and beyond or in other words one understanding leads to the next. Have fun!
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
SAS' Charu Shankar shares her PROC SQL expertise by showing you how to master the WHERE clause using real winter weather data.
Find more tutorials on the SAS Users YouTube channel.