DATA Step, Macro, Functions and more

Identifying similar strings

Reply
N/A
Posts: 0

Identifying similar strings

Hi

I want to compare one basic string with a number of strings, and identify the string which is most similar with the basic string. Is there a nice Macro facility or a SAS procedure that may manage this problem?
SAS Employee
Posts: 174

Re: Identifying similar strings

Posted in reply to deleted_user
Have a look at these functions
- SOUNDEX Function -> http://support.sas.com/onlinedoc/913/getDoc/da/lrdict.hlp/a000245948.htm
- SPEDIS Function -> http://support.sas.com/onlinedoc/913/getDoc/da/lrdict.hlp/a000245949.htm

And these samples
- 24582 - Combine data sets based upon similar values -> http://support.sas.com/kb/24/582.html
Encode character strings using SOUNDEX to aid in combining the data based upon similar but not exact values.

- 33340 - Using DATA Step functions to check character values or variables for near equality -> http://support.sas.com/kb/33/340.html
Several DATA step functions can be used to quantify or measure the difference between two character values

or these papers
- SOUNDEX -> http://support.sas.com/dsearch?qt=SOUNDEX+&ct=5240&Find=Search&col=suppprd&nh=10&qp=&qc=suppsas&ws=1...
- SPEDIS -> http://support.sas.com/dsearch?qt=SPEDIS+&ct=5240&Find=Search&col=suppprd&nh=10&qp=&qc=suppsas&ws=1&...

Please share your solution
Ask a Question
Discussion stats
  • 1 reply
  • 144 views
  • 0 likes
  • 2 in conversation