Need SAS procedure!
I have a situation where I have duplicate records but there is one variable that is different and desire to keep that specific record. For instance:
1) Customer Name Customer ID Address Customer Type
Joe Doe 123 123 Way Online (retain)
Joe Doe 123 123 Way In-Store (delete)
Ken Moore 456 456 Way Online (retain)
Ken Moore 456 456 Way In-Store (delete)
Lisa Mae 789 789 Way In-Store (retain)
I want to keep the "Online" record (if duplicates) and delete the "In-Store" records. However, when there are no duplicates I retain "In-Store" records.
Thanks for your help!
proc sort data=have out=want; by customer_name customer_id address descending customer_type; run;
proc sort data=want nodupkey;
by customer_name customer_id address;
run;
The first sort with descending customer_type is key, it will sort it so that online is always before in-store if present, the second sort with nodupkey will just take the first observation which should be online if present, or in-store if not.
Note, not tested, post test data in the form of a datastep to get tested code.
proc sort data=have out=want; by customer_name customer_id address descending customer_type; run;
proc sort data=want nodupkey;
by customer_name customer_id address;
run;
The first sort with descending customer_type is key, it will sort it so that online is always before in-store if present, the second sort with nodupkey will just take the first observation which should be online if present, or in-store if not.
Note, not tested, post test data in the form of a datastep to get tested code.
This worked! Thank you very much...
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.