The subject might be misleading, but what i'm trying to do is take a dataset with matched names, and create a new dataset that lists all matches based on the "best" name (based on highest value in the RECORDS1 column). I have provided both a visual of what i'm trying to do as well as the HAVE/WANT code.
If anyone needs additional information to help solve please let me know. Thanks in advance!
HAVE:
NAME1
RECORDS1
NAME2
RECORDS2
TOM
5243
TOMMY
4
BRAD
873
BRADLEY
219
BRADLEY
219
BRAD
873
JOHN
61017
JOHNNY
905
JOHNNY
905
JOHN
61017
JONATHAN
500
JOHNNY
905
WANT:
NAME1
NAME2
TOM
TOMMY
BRAD
BRADLEY
JOHN
JOHNNY
JOHN
JONATHAN
DATA have; INFILE DATALINES DSD; INPUT NAME1 $ RECORDS1 NAME2 $ RECORDS2; DATALINES; TOM,5243,TOMMY,4 BRAD,873,BRADLEY,219 BRADLEY,219,BRAD,873 JOHN,61017,JOHNNY,905 JOHNNY,905,JOHN,61017 JONATHAN,500,JOHNNY,905
; run;
DATA want; INFILE DATALINES DSD; INPUT NAME1 $ NAME2 $ ; DATALINES; TOM,TOMMY BRAD,BRADLEY JOHN,JOHNNY JOHN,JONATHAN
; run;
... View more