How can i deduplicate a dataset by ID1 and ID2, if I have the next structure:
ID1 ID2 DATE VALUE
1000 10 31JAN2016 5
1000 10 26FEB2016 6
1000 10 31MAR2016 7
The result that I expectedis:
ID1 ID2 DATE VALUE
1000 10 31MAR2016 7
(Which is the most current date according to the date)
Use proc sort.
proc sort data=have;
by id id2 date; run;
data want;
set have;
by id id2;
if last.id2;
run;
Use proc sort.
proc sort data=have;
by id id2 date; run;
data want;
set have;
by id id2;
if last.id2;
run;
No, it has to be a variable listed in the BY statement.
And it picks the last of that group, based on your specifications, this is what was required, but you can modify it to fit your needs, if you have more variables or something.
You can review the documentation on how BY groups are processed and the FIRST/LAST variables and how they're calculated.
I find the examples and illustrations here helpful:
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.