BookmarkSubscribeRSS Feed
Kelli
Calcite | Level 5

I have a dataset where I want to delete duplicate entries for the same person.  I have an employee ID field and then two values A and B for a second variable.  In any case where there is a duplicate employee ID, I want to delete the entry that has a value of B for the second variable.  Is there a simple way to code for this?

2 REPLIES 2
Peter_C
Rhodochrosite | Level 12

assuming your data are sorted by empID, how about

data REDUCED ;

set yourdata( where=( secondvar NE 'B' ))

      yourdata( where=( secondvar EQ 'B' )) ;

by empID ;

if first.empID ;

run ;

sas121987
Calcite | Level 5

Hello you can do this in using nodupkey in data step using first.

please find the code logic:

proc sort data = test ;

by employeeID Column2;

run;

data test2;

set  test;

by employeeID ;

if first.column2 then output;

run;

Here you will get the unique first values of column2

Thanks

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1190 views
  • 0 likes
  • 3 in conversation