BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
BenBrady
Obsidian | Level 7

I want to keep only the duplicates where there is an identical match betwwen sport1 and sport2. So in the following table I only want to keep Person A where sport1 and Sport2 are both soccer and Person D where Sport1 and Sport2 are both Tennis

 

 

 

PersonSport1Sport2
ASoccerSoccer
ASoccerTable Tennis
ASoccerRugby
BFootyFooty
CBaseballBaseball
DTennisBadmington
DTennisTennis
1 ACCEPTED SOLUTION

Accepted Solutions
Kurt_Bremser
Super User
data want;
set have;
if sport1 = sport2;
run;

This is called a subsetting if.

You can do it in a where condition, if you want only one entry per person:

proc sort
  data=have (
    where=(sport1 = sport2)
  )
  out=want (keep=person)
  nodupkey
;
by person;
run;

View solution in original post

2 REPLIES 2
Kurt_Bremser
Super User
data want;
set have;
if sport1 = sport2;
run;

This is called a subsetting if.

You can do it in a where condition, if you want only one entry per person:

proc sort
  data=have (
    where=(sport1 = sport2)
  )
  out=want (keep=person)
  nodupkey
;
by person;
run;
Ksharp
Super User
To be honest. I don't understand your question.

data have;
input (Person Sport1 Sport2) ($);
cards;
A Soccer Soccer 
A Soccer Table Tennis 
A Soccer Rugby 
B Footy Footy 
C Baseball Baseball 
D Tennis Badmington 
D Tennis Tennis 
;
run;
proc sql;
select *
 from have
  group by person
   having sum(sport1=sport2) ne count(*) and  sum(sport1=sport2) ne 0;
quit;

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1110 views
  • 1 like
  • 3 in conversation