Solved: How to delete duplicate rows using proc sql?

nikunjgattani · Posted 02-12-2015 11:22 PM

Hi,

I wan to delete duplicate rows using proc sql (it is possible by proc sort with nodupkey)

For ex - In the below example, i want to remove duplicates on the basis of name and age.

Input -

id name age company

1 aik 26 tcs

2 aik 29 infosys

3 bik 23 wns

4 bik 23 tcs

5 cik 30 infosys

6 cik 28 wns

Output -

id name age company

1 aik 26 tcs

2 aik 29 infosys

3 bik 23 wns

5 cik 30 infosys

6 cik 28 wns

Ksharp · Posted 02-13-2015 02:58 AM

data have;
input id name $  age company $;
cards;
1 aik 26 tcs
2 aik 29 infosys
3 bik 23 wns
4 bik 23 tcs
5 cik 30 infosys
6 cik 28 wns
;
run;
proc sql;
 select *
  from have
   group by name,age
     having id=min(id);
quit;

Xia Keshan

View solution in original post

Reeza · Posted 02-13-2015 12:54 AM

Use Select Distinct to get unique records.

proc sql;

create table want as

select distinct id, name, age, company

from have;

quit;

nikunjgattani · Posted 02-13-2015 12:58 AM

Hi Reeza,

Your query will create distinct rows, i am looking for distinct name and age combination only.

Thanks

Nikunj

Reeza · Posted 02-13-2015 01:00 AM

For duplicates do you care which company gets attached?

You can basically do a group by and summarize on those columns, a data step with first/last gives you more control.

proc sql;

create table want as

select min(id) as ID, name, age, min(company) as company

from have

group by name, age

order by id, name, age;

quit;

Ksharp · Posted 02-13-2015 02:58 AM

data have;
input id name $  age company $;
cards;
1 aik 26 tcs
2 aik 29 infosys
3 bik 23 wns
4 bik 23 tcs
5 cik 30 infosys
6 cik 28 wns
;
run;
proc sql;
 select *
  from have
   group by name,age
     having id=min(id);
quit;

Xia Keshan

How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Catch up on SAS Innovate 2026

How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Re: How to delete duplicate rows using proc sql?

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away