Help using Base SAS procedures

String Difference in Percentage

Reply
Contributor JAR
Contributor
Posts: 45

String Difference in Percentage

Dear All,

I would like to learn how to calculate % change, when two strings are compared.

Here is the original file:

1RATCAT
2BELLBALL
3TIMETOM

As there is only one character that is different in the first observation, it should return 1/3 %(number of deviation / total number of char in the original).

Desired output is something like this:

1RATCAT33
2BELLBALL25
3TIMETOM50

Thanks in advance,

Jijil Ramakrishnan

Trusted Advisor
Posts: 1,228

Re: String Difference in Percentage

Try this for the desired output.

data have;
input obs var1 $ var2 $;
datalines;
1 RAT CAT
2 BELL BALL
3 TIME TOM
;

data want;
set have;

do i=1 by 1 while(substr(var1,i,1) ne ' ');
if index(var2,substr(var1,i,1))>0 then cnt+1;
output;
end;
cnt=0;
run;

data final (drop=i cnt);
set want;
by obs;
if last.obs;
diff=(i-cnt)/i*100;
format diff 8.0;
run;

Contributor JAR
Contributor
Posts: 45

Re: String Difference in Percentage

Thank you so much.

Respected Advisor
Posts: 4,920

Re: String Difference in Percentage

Approximate matches between strings is an already well researched topic. SAS provides you with many tools in that area. Please look at SAS functions COMPGED, COMPLEV, SOUNDEX, and SPEDIS.

PG

PG
Contributor JAR
Contributor
Posts: 45

Re: String Difference in Percentage

Thanks a lot!

Ask a Question
Discussion stats
  • 4 replies
  • 362 views
  • 7 likes
  • 3 in conversation