How Compare two strings by percentage

Reply
Occasional Contributor
Posts: 5

How Compare two strings by percentage

Hello All,

 

I have two variable var1 var2.

In var1 character is like "marutivx" and In var2 character is like "marutivx(0)". I need to calculate the similarities/differences by percentage.

Please help on little urgent basis.

 

Thanks,

Shubho

Regular Contributor
Posts: 217

Re: How Compare two strings by percentage

Posted in reply to shubha1babu

This will help you:  http://support.sas.com/kb/48/582.html

 

Super User
Posts: 10,784

Re: How Compare two strings by percentage

Posted in reply to shubha1babu

That is a tough task. How you define similarities/differences ?

Anyway, COMPARE() might give you a little help.

 

data _null_;
var1="marutivx";var2="marutivx(0)";
diff=compare(var1,var2,'i');
put var1= var2= diff=;
run;
Occasional Contributor
Posts: 5

Re: How Compare two strings by percentage

Let me frame it once again.

I have 1000 obs. and two variables var1 and var2.I want to create 3rd variable which will show the similarities/differences between two strings by percentage wise.

ex.

var1          var2

maruti      marutix

vxez        vexz12

LX CNG  

               JHXD GX

and so on......

Regular Contributor
Posts: 217

Re: How Compare two strings by percentage

Posted in reply to shubha1babu

That's what my previous post covers...the complev function.

Occasional Contributor
Posts: 5

Re: How Compare two strings by percentage

Posted in reply to tomrvincent

Great!! Thanks a lot Tom.It worked.

 

But I need little more help.Need to compare two strings by first 5 alphabets.

e.g     var1               var 2                 match_parentage

         marutix            marutisyz         100

 

Thanks.

 

Super User
Posts: 13,563

Re: How Compare two strings by percentage

Posted in reply to shubha1babu

@shubha1babu wrote:

Great!! Thanks a lot Tom.It worked.

 

But I need little more help.Need to compare two strings by first 5 alphabets.

e.g     var1               var 2                 match_parentage

         marutix            marutisyz         100

 

Thanks.

 


Use the SUBSTR function to select the first 5 characters from each variable.

 

result = complev(substr(var1,1,5), substr(var2,1,5));

Ask a Question
Discussion stats
  • 6 replies
  • 124 views
  • 0 likes
  • 4 in conversation