Say I have a data set with 4 character variables: x1-x4.
The data looks something like: ab cd ab ca.
I want to compare each variable to the other. Therefore, I have 6 combinations. I am interested in the ones that match (i.e., x1 matches x3, 'ab').
So my output should contain 3 columns:
First and second columns list the variables being compared to each other (total of 6 rows due to 6 combinations).
The third column stating whether or not there is a match.
I know to calculate the number of combinations, I use: c42=comb(4,2).
I am stuck thereafter. Any suggestions?
OK, I have added one more variable 'num' to identify the original obs no., take it out as you wish.
data have;
input (x1-x4) (:$);
cards;
aa ab aa bd
dd cc dd cc
;
data want (keep=first second match num) ;
set have;
array xx $ x1-x4;
do _i=1 to dim(xx);
do _j=_i+1 to dim(xx);
first=vname(xx(_i));
second=vname(xx(_j));
if xx(_i)=xx(_j) then match='Y';
else match='N';
Num=_n_;
output;
end;
end;
run;
proc print;run;
Regards,
Haikuo
Thank you very much! That helped a lot and it was very straightforward! I was able to tweak it and fit it to a more complicated data set.
Thanks again!
Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.
Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.