BookmarkSubscribeRSS Feed
DDawaba
Calcite | Level 5

Hello,

 

I am trying to use the soundex function on SAS Enterprise however the column is in Arabic Language and not in English.

 

Is there a way to handle this please?

4 REPLIES 4
Rick_SAS
SAS Super FREQ

Not easily, no.

The documentation for the SOUNDEX function describes how the function works. The doc says, that the algorithm (from the 1910'1 and '20s!) "is English-biased and is less useful for languages other than English." The doc also describes exactly how it works. It basically encodes every word and then declares that words that have the same encoding "sound alike."

 

To work with a non-English language, you would have to determine a similar encoding that encodes the sounds (based on characters) in the desired language. You would then write an FCMP function that implements the algorithm and can be used in the DATA step. Theoretically, this is possible for languages that use single-byte characters (such as Spanish and French), but it becomes much more difficult for multiple-byte character sets such as Arabic, Korean, or Chinese.  In practice, I suspect this would be extremely difficult.

SASKiwi
PROC Star

What is the actual business problem you are trying to solve by using SOUNDEX? If you explain that then maybe there is another way that might work.

DDawaba
Calcite | Level 5
I have 420 Business Natures for over 400K Clients yet its in Arabic with . There are several similar Business Natures within these 420 options that could be grouped together and accordingly lessen the options for further statistical analysis.
SASKiwi
PROC Star

If there are only 420 unique business descriptions, then manually building a lookup table to group them shouldn't take too long. I would put these in a spreadsheet and then type in the groups. Once complete, just import the the spreadsheet back into a SAS dataset and join it back to your original data. 

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

SAS Enterprise Guide vs. SAS Studio

What’s the difference between SAS Enterprise Guide and SAS Studio? How are they similar? Just ask SAS’ Danny Modlin.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 4 replies
  • 327 views
  • 0 likes
  • 3 in conversation