<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Need to use COMPGED or COMPLEV function on multi-byte text data  b 	ـب 	ـبـ in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484086#M125614</link>
    <description>&lt;P&gt;I guess you'd have to compute the distance yourself using the k* functions.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. That's good ballot entry&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. Choices must be made as that's not a straight forward computation. What is the distance between&amp;nbsp;&lt;CODE&gt;&lt;SPAN class="html"&gt; &lt;SPAN class="string"&gt;'hä' and &lt;/SPAN&gt;&lt;SPAN class="string"&gt;'hà'&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&amp;nbsp; ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;3. Syllabic scripts or ideograms would provide nice head-scratchers (though there may already be algorithms for these).&lt;/P&gt;
&lt;P&gt;Even comparisons of alphabetic scripts like Arabic would not be easy as the character changes depending on the position.&lt;/P&gt;
&lt;P&gt;&lt;FONT size="5"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ـب &amp;nbsp;&amp;nbsp; &amp;nbsp;ـبـ&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; بـ &amp;nbsp;&amp;nbsp; &amp;nbsp;ب&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/FONT&gt; are all letter B.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 05 Aug 2018 02:07:16 GMT</pubDate>
    <dc:creator>ChrisNZ</dc:creator>
    <dc:date>2018-08-05T02:07:16Z</dc:date>
    <item>
      <title>Need to use COMPGED or COMPLEV function on multi-byte text data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484073#M125609</link>
      <description>&lt;P&gt;Hi, all&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I need to use something like the "compged" or "complev" function to compare two text strings, but I need to process UTF-8 data containing weird and wild characters. The SAS NLS guides say that these two functions aren't certified for multi-byte character data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Does anybody have any suggestions for how I can do this?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Much thanks,&lt;BR /&gt; Tom&lt;/P&gt;</description>
      <pubDate>Sat, 04 Aug 2018 23:45:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484073#M125609</guid>
      <dc:creator>TomKari</dc:creator>
      <dc:date>2018-08-04T23:45:41Z</dc:date>
    </item>
    <item>
      <title>Re: Need to use COMPGED or COMPLEV function on multi-byte text data  b 	ـب 	ـبـ</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484086#M125614</link>
      <description>&lt;P&gt;I guess you'd have to compute the distance yourself using the k* functions.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. That's good ballot entry&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. Choices must be made as that's not a straight forward computation. What is the distance between&amp;nbsp;&lt;CODE&gt;&lt;SPAN class="html"&gt; &lt;SPAN class="string"&gt;'hä' and &lt;/SPAN&gt;&lt;SPAN class="string"&gt;'hà'&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&amp;nbsp; ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;3. Syllabic scripts or ideograms would provide nice head-scratchers (though there may already be algorithms for these).&lt;/P&gt;
&lt;P&gt;Even comparisons of alphabetic scripts like Arabic would not be easy as the character changes depending on the position.&lt;/P&gt;
&lt;P&gt;&lt;FONT size="5"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ـب &amp;nbsp;&amp;nbsp; &amp;nbsp;ـبـ&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; بـ &amp;nbsp;&amp;nbsp; &amp;nbsp;ب&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/FONT&gt; are all letter B.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 05 Aug 2018 02:07:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484086#M125614</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-08-05T02:07:16Z</dc:date>
    </item>
    <item>
      <title>Re: Need to use COMPGED or COMPLEV function on multi-byte text data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484099#M125618</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/15142"&gt;@TomKari&lt;/a&gt;: Do a google search for:&amp;nbsp;generalized edit distance utf-8 r&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are a number of r packages available.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Art, CEO, AnalystFinder.com&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 05 Aug 2018 03:25:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484099#M125618</guid>
      <dc:creator>art297</dc:creator>
      <dc:date>2018-08-05T03:25:26Z</dc:date>
    </item>
    <item>
      <title>Re: Need to use COMPGED or COMPLEV function on multi-byte text data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484101#M125620</link>
      <description>&lt;P&gt;Look as the BASECHAR function in NLS. Without the second argument, it returns an ASCII version of your string. At least, that's what the documentation example suggests.&lt;/P&gt;</description>
      <pubDate>Sun, 05 Aug 2018 04:54:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Need-to-use-COMPGED-or-COMPLEV-function-on-multi-byte-text-data/m-p/484101#M125620</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2018-08-05T04:54:08Z</dc:date>
    </item>
  </channel>
</rss>

