Desktop productivity for business analysts and programmers

Removing duplicates rows by max date

Reply
New Contributor
Posts: 2

Removing duplicates rows by max date

I WOULD LIKE TO REMOVE DUPLICATES FROM A VARIABLE KEY TO THE MAXIMUM DATE OF ANOTHER VARIABLE AND I DO NOT KNOW HOW, CAN YOU HELP ME?

Example:

 

COLUMN A(KEY)  COLUMN B(DATE)

1                             20170422

1                             20170423

1                             20170425

 

I NEED THE SAME KEY WITH MAXIMUM DATE, REMOVING THE DUPLICATED KEYS KEEPING THE KEYWORK WITHOUT DUPLICATION FOR THE LAST DATE AVAIABLE TO THIS KEY. CAN YOU UNDERSTAND ME? TKS

Frequent Contributor
Posts: 107

Re: Removing duplicates rows by max date

proc sql;

select *

from have

group by key

having date=max(date);

quit;/*date is column b*/

New Contributor
Posts: 2

Re: Removing duplicates rows by max date

Sorry for my answer, thanks a lot for your help.

 

 

Respected Advisor
Posts: 4,973

Re: Removing duplicates rows by max date

SAS contains tools to handle this:

 

proc sort data=have;

by key date;

run;

 

data want;

set have;

by key date;

if last.key;

run;

Esteemed Advisor
Esteemed Advisor
Posts: 7,203

Re: Removing duplicates rows by max date

Nope, can't understand you, maybe your CapsLock is stuck?  Maybe sort the data in order, then take last or first value?

Ask a Question
Discussion stats
  • 4 replies
  • 101 views
  • 2 likes
  • 4 in conversation