BookmarkSubscribeRSS Feed
Lien728
Calcite | Level 5

Say I have a data set with 4 character variables: x1-x4. 

The data looks something like: ab cd ab ca.

I want to compare each variable to the other. Therefore, I have 6 combinations. I am interested in the ones that match (i.e., x1 matches x3, 'ab').

So my output should contain 3 columns:

First and second columns list the variables being compared to each other (total of 6 rows due to 6 combinations).

The third column stating whether or not there is a match.

I know to calculate the number of combinations, I use: c42=comb(4,2).

I am stuck thereafter. Any suggestions?

2 REPLIES 2
Haikuo
Onyx | Level 15

OK, I have added one more variable 'num' to identify the original obs no., take it out as you wish.

data have;

input (x1-x4) (:$);

cards;

aa ab aa bd

dd cc dd cc

;

data want (keep=first second match num) ;

set have;

array xx $ x1-x4;

do _i=1 to dim(xx);

   do _j=_i+1 to dim(xx);

first=vname(xx(_i));

second=vname(xx(_j));

       if  xx(_i)=xx(_j) then   match='Y';

else match='N';

Num=_n_;

output;

end;

end;

run;

proc print;run;

Regards,

Haikuo

Lien728
Calcite | Level 5

Thank you very much! That helped a lot and it was very straightforward! I was able to tweak it and fit it to a more complicated data set.

Thanks again!

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1244 views
  • 4 likes
  • 2 in conversation