<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to create a data step version of your data AKA generate sample data for forums in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664820#M198677</link>
    <description>&lt;P&gt;Also, if the Arabic letters are encoded using UTF-8, the "common" string functions (such as &lt;FONT size="2"&gt;compress()&lt;/FONT&gt; or &lt;FONT size="2"&gt;prxchange()&lt;/FONT&gt; ) will not work.&amp;nbsp;You must use MBCS-aware functions, such as &lt;FONT size="2"&gt;kcompress()&lt;/FONT&gt;&amp;nbsp;. Not all single-byte functions have a multi-byte equivalent.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 25 Jun 2020 00:59:30 GMT</pubDate>
    <dc:creator>ChrisNZ</dc:creator>
    <dc:date>2020-06-25T00:59:30Z</dc:date>
    <item>
      <title>RegEx and Arabic letters</title>
      <link>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664689#M198663</link>
      <description>&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data _null_;
	v1 = "aYT;-/*A - تwsilيتيتddيت يlămâieD  @يlămâieD D^)M";
	v2=upcase(v1);
	v3 = prxchange("s/[^-A-Z]/ /",-1,v2); 
    v4 = compress(v3, , 'kw');
put (_all_) (=/);
run; 

&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;I want to keep only English characters ,hyphen "-" , and Arabic characters .this code worked fine for the&amp;nbsp; English characters and hyphen but it escape the Arabic characters&amp;nbsp; , how can I keep the Arabic&amp;nbsp;characters&amp;nbsp; too !!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 24 Jun 2020 23:49:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664689#M198663</guid>
      <dc:creator>nour_anwar</dc:creator>
      <dc:date>2020-06-24T23:49:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to create a data step version of your data AKA generate sample data for forums</title>
      <link>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664797#M198664</link>
      <description>&lt;P&gt;Moved and re-titled.&lt;/P&gt;
&lt;P&gt;Did you try letter intervals like A-Z ?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Or even adding all letters to the character class [ ]&amp;nbsp; ?&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jun 2020 00:52:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664797#M198664</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2020-06-25T00:52:20Z</dc:date>
    </item>
    <item>
      <title>Re: How to create a data step version of your data AKA generate sample data for forums</title>
      <link>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664820#M198677</link>
      <description>&lt;P&gt;Also, if the Arabic letters are encoded using UTF-8, the "common" string functions (such as &lt;FONT size="2"&gt;compress()&lt;/FONT&gt; or &lt;FONT size="2"&gt;prxchange()&lt;/FONT&gt; ) will not work.&amp;nbsp;You must use MBCS-aware functions, such as &lt;FONT size="2"&gt;kcompress()&lt;/FONT&gt;&amp;nbsp;. Not all single-byte functions have a multi-byte equivalent.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jun 2020 00:59:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664820#M198677</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2020-06-25T00:59:30Z</dc:date>
    </item>
    <item>
      <title>Re: RegEx and Arabic letters</title>
      <link>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664936#M198720</link>
      <description>&lt;P&gt;I found this note in the docs of prxmatch:&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;This function is assigned an I18N Level 0 status, and is designed for SBCS data. Do not use this function to process DBCS or MBCS data.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Same problem with compress, fortunately kcompress exists, so maybe this is what you need:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data _null_;
	v1 = "aYT;-/*A - تwsilيتيتddيت يlămâieD  @يlămâieD D^)M";
	v2 = upcase(v1);
   v3 = kcompress(v2, , 'kw');
   put (_all_) (=/);
run; &lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 25 Jun 2020 09:56:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664936#M198720</guid>
      <dc:creator>andreas_lds</dc:creator>
      <dc:date>2020-06-25T09:56:32Z</dc:date>
    </item>
    <item>
      <title>Re: RegEx and Arabic letters</title>
      <link>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664977#M198741</link>
      <description>Not sure.&lt;BR /&gt;Could Try &lt;BR /&gt;&lt;BR /&gt;Kcompress()&lt;BR /&gt;Ksubstr()&lt;BR /&gt;Klength()</description>
      <pubDate>Thu, 25 Jun 2020 12:04:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/RegEx-and-Arabic-letters/m-p/664977#M198741</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2020-06-25T12:04:14Z</dc:date>
    </item>
  </channel>
</rss>

