Solved: Removing dups in dataset

SASPreK · Posted 05-29-2024 06:07 PM

I have the following input dataset

ID Var1 Var2

11 A 23

11 B 121

11 A 23

12 A 32

12 B 158

12 A 32

12 B 158

13 A 87

13 B 567

I want to remove the repetitive records by ID variable and get the following output.

ID Var1 Var2

11 A 23

11 B 121

12 A 32

12 B 158

13 A 87

13 B 567

Please suggest the best way to do it.

ballardw · Posted 05-29-2024 06:49 PM

Not possible. SAS will not allow two variables with the same name.

If the third column is VAR2:

data have;
   input ID   Var1 $   Var2 ;
datalines;
11       A          23
11       B          121
11       A          23
12      A          32
12      B           158
12      A           32
12      B           158
13      A            87
13      B            567
;

Then easiest code is:

proc sort data=have out=want nodupkey;
   by id var1 var2;
run;

The NODUPKEY option tells SAS to sort the data but only allow one observation with each combination of the BY variables in the OUT= data set.

There are options to send the deleted observations to a different output set.

View solution in original post

ballardw · Posted 05-29-2024 06:49 PM

Not possible. SAS will not allow two variables with the same name.

If the third column is VAR2:

data have;
   input ID   Var1 $   Var2 ;
datalines;
11       A          23
11       B          121
11       A          23
12      A          32
12      B           158
12      A           32
12      B           158
13      A            87
13      B            567
;

Then easiest code is:

proc sort data=have out=want nodupkey;
   by id var1 var2;
run;

The NODUPKEY option tells SAS to sort the data but only allow one observation with each combination of the BY variables in the OUT= data set.

There are options to send the deleted observations to a different output set.

SASPreK · Posted 05-31-2024 11:22 AM

Thank you so much for the solution, this works! I have edited my question to name the variables as Var1 and Var2, thank you for catching that 🙂

Removing dups in dataset

Re: Removing dups in dataset

Re: Removing dups in dataset

Re: Removing dups in dataset

Catch up on SAS Innovate 2026

Removing dups in dataset

Re: Removing dups in dataset

Re: Removing dups in dataset

Re: Removing dups in dataset

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away