<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Soundex for Arabic Language in SAS Enterprise Guide</title>
    <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881846#M43196</link>
    <description>I have 420 Business Natures for over 400K Clients yet its in Arabic with . There are several similar Business Natures within these 420 options that could be grouped together and accordingly lessen the options for further statistical analysis.</description>
    <pubDate>Thu, 22 Jun 2023 07:06:48 GMT</pubDate>
    <dc:creator>DDawaba</dc:creator>
    <dc:date>2023-06-22T07:06:48Z</dc:date>
    <item>
      <title>Soundex for Arabic Language</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881683#M43192</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to use the soundex function on SAS Enterprise however the column is in Arabic Language and not in English.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there a way to handle this please?&lt;/P&gt;</description>
      <pubDate>Wed, 21 Jun 2023 11:32:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881683#M43192</guid>
      <dc:creator>DDawaba</dc:creator>
      <dc:date>2023-06-21T11:32:16Z</dc:date>
    </item>
    <item>
      <title>Re: Soundex for Arabic Language</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881757#M43193</link>
      <description>&lt;P&gt;Not easily, no.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/lefunctionsref/n1i9a3o4kciemhn1kpgutl20e4i0.htm" target="_self"&gt;The documentation for the SOUNDEX function&lt;/A&gt; describes how the function works. The doc says, that the algorithm (from the 1910'1 and '20s!) "is English-biased and is less useful for languages other than English." The doc also describes exactly how it works. It basically encodes every word and then declares that words that have the same encoding "sound alike."&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To work with a non-English language, you would have to determine a similar encoding that encodes the sounds (based on characters) in the desired language. You would then write an FCMP function that implements the algorithm and can be used in the DATA step. Theoretically, this is possible for languages that use single-byte characters (such as Spanish and French), but it becomes much more difficult for multiple-byte character sets such as Arabic, Korean, or Chinese.&amp;nbsp; In practice, I suspect this would be extremely difficult.&lt;/P&gt;</description>
      <pubDate>Wed, 21 Jun 2023 18:08:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881757#M43193</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2023-06-21T18:08:57Z</dc:date>
    </item>
    <item>
      <title>Re: Soundex for Arabic Language</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881800#M43194</link>
      <description>&lt;P&gt;What is the actual business problem you are trying to solve by using SOUNDEX? If you explain that then maybe there is another way that might work.&lt;/P&gt;</description>
      <pubDate>Wed, 21 Jun 2023 20:07:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881800#M43194</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2023-06-21T20:07:08Z</dc:date>
    </item>
    <item>
      <title>Re: Soundex for Arabic Language</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881846#M43196</link>
      <description>I have 420 Business Natures for over 400K Clients yet its in Arabic with . There are several similar Business Natures within these 420 options that could be grouped together and accordingly lessen the options for further statistical analysis.</description>
      <pubDate>Thu, 22 Jun 2023 07:06:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/881846#M43196</guid>
      <dc:creator>DDawaba</dc:creator>
      <dc:date>2023-06-22T07:06:48Z</dc:date>
    </item>
    <item>
      <title>Re: Soundex for Arabic Language</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/882060#M43202</link>
      <description>&lt;P&gt;If there are only 420 unique business descriptions, then manually building a lookup table to group them shouldn't take too long. I would put these in a spreadsheet and then type in the groups. Once complete, just import the the spreadsheet back into a SAS dataset and join it back to your original data.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Jun 2023 02:42:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Soundex-for-Arabic-Language/m-p/882060#M43202</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2023-06-23T02:42:52Z</dc:date>
    </item>
  </channel>
</rss>

