<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic how to remove emoji from an xml file using a unix command in SAS Enterprise Guide</title>
    <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934544#M44927</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am download a file using encoding = "UFT-8" and I have found that I have emoji into the xlm file.&lt;/P&gt;
&lt;P&gt;I need to remove only the emoji from the xml file without changing the file structure.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How do we do that?&lt;/P&gt;
&lt;P&gt;Please note that&amp;nbsp;&lt;SPAN&gt;accented characters need to be kept into the xml file due to French language.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;ex;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;What I have:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;lt;QID2_TEXT&amp;gt;La dame a été très patience et gentille et que ça fait longtemps que je suis assurée avec vous. &lt;BR /&gt;Si c’était possible, je me demandais s’il n’y aurais pas possibilité de diminuer le coût de l’assurance? &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&amp;lt;/QID2_TEXT&amp;gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;What I want:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;lt;QID2_TEXT&amp;gt;La dame a été très patience et gentille et que ça fait longtemps que je suis assurée avec vous. &lt;BR /&gt;Si c’était possible, je me demandais s’il n’y aurais pas possibilité de diminuer le coût de l’assurance? &amp;lt;/QID2_TEXT&amp;gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 03 Jul 2024 14:12:03 GMT</pubDate>
    <dc:creator>alepage</dc:creator>
    <dc:date>2024-07-03T14:12:03Z</dc:date>
    <item>
      <title>how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934544#M44927</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am download a file using encoding = "UFT-8" and I have found that I have emoji into the xlm file.&lt;/P&gt;
&lt;P&gt;I need to remove only the emoji from the xml file without changing the file structure.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How do we do that?&lt;/P&gt;
&lt;P&gt;Please note that&amp;nbsp;&lt;SPAN&gt;accented characters need to be kept into the xml file due to French language.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;ex;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;What I have:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;lt;QID2_TEXT&amp;gt;La dame a été très patience et gentille et que ça fait longtemps que je suis assurée avec vous. &lt;BR /&gt;Si c’était possible, je me demandais s’il n’y aurais pas possibilité de diminuer le coût de l’assurance? &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&amp;lt;/QID2_TEXT&amp;gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;What I want:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;lt;QID2_TEXT&amp;gt;La dame a été très patience et gentille et que ça fait longtemps que je suis assurée avec vous. &lt;BR /&gt;Si c’était possible, je me demandais s’il n’y aurais pas possibilité de diminuer le coût de l’assurance? &amp;lt;/QID2_TEXT&amp;gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 03 Jul 2024 14:12:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934544#M44927</guid>
      <dc:creator>alepage</dc:creator>
      <dc:date>2024-07-03T14:12:03Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934650#M44934</link>
      <description>&lt;P&gt;You could remove all unwanted characters using Perl.&lt;/P&gt;
&lt;P&gt;Example of the Perl syntax to process a string:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;echo ' Cœur coût de l’assurance? 🙂&amp;lt;/QID2_TEXT&amp;gt;' | perl -C -pe 's/[^[:alnum:][:space:][:punct:]]+//g'&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;where you only keep alphanumeric characters, spaces and p&lt;SPAN&gt;unctuation symbols&lt;/SPAN&gt; results in:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier" color="#808080"&gt;&amp;nbsp;Cœur coût de l’assurance? &amp;lt;/QID2_TEXT&amp;gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier" color="#808080"&gt;1&lt;/FONT&gt;. Can't you have your SAS session using UTF-8? Your organisation should move to UTF8 to avoid this kind of headaches.&lt;/P&gt;
&lt;P&gt;2. Note that the &lt;FONT size="4"&gt;&lt;CODE class=" language-sas"&gt;alnum&lt;/CODE&gt;&lt;/FONT&gt; posix expression is locale-specific&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jul 2024 05:55:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934650#M44934</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2024-07-04T05:55:23Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934671#M44937</link>
      <description>&lt;P&gt;I am loosing the apostrophe. How to keep / allow apostrophe&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Cur coût de lassurance? &amp;lt;/QID2_TEXT&amp;gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jul 2024 10:01:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934671#M44937</guid>
      <dc:creator>alepage</dc:creator>
      <dc:date>2024-07-04T10:01:31Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934674#M44938</link>
      <description>That's strange. That's not the case for me.&lt;BR /&gt;The apostrophe is part of [:punct:] so should be conserved.&lt;BR /&gt;&lt;A href="https://www.regular-expressions.info/posixbrackets.html" target="_blank"&gt;https://www.regular-expressions.info/posixbrackets.html&lt;/A&gt;&lt;BR /&gt;[:punct:]&lt;BR /&gt;Punctuation (and symbols).&lt;BR /&gt;!"#$%&amp;amp;amp;'()*+,-./\:;&amp;amp;lt;=&amp;amp;gt;?@[]^_`{|}~</description>
      <pubDate>Thu, 04 Jul 2024 10:47:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934674#M44938</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2024-07-04T10:47:10Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934684#M44939</link>
      <description>&lt;P&gt;How to apply your perl script to the xml file&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;ex:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;perl -C -pe 's/[^èàûéîôÇÉÇÈ"@-_&amp;lt;&amp;gt;[:ascii:][:alnum:][:space:][:punct:]]+//g' /finsys.../VirageSurvey_2.xml&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jul 2024 11:34:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934684#M44939</guid>
      <dc:creator>alepage</dc:creator>
      <dc:date>2024-07-04T11:34:19Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934685#M44940</link>
      <description>&lt;P&gt;The answer is easy to find if you'd just search. &lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;perl -pe 's/[..]//g' &amp;lt; file.xml &amp;gt; file2.xml&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 04 Jul 2024 12:01:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934685#M44940</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2024-07-04T12:01:59Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934692#M44941</link>
      <description>&lt;P&gt;It works well thank you. But I still have the issue with the apostrophe that are missing.&amp;nbsp; Do you know a work around to keep apostrophe since punct does not keep it.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jul 2024 14:15:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934692#M44941</guid>
      <dc:creator>alepage</dc:creator>
      <dc:date>2024-07-04T14:15:25Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove emoji from an xml file using a unix command</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934719#M44945</link>
      <description>&lt;P&gt;If you have more characters to conserve, just add them to the list.&lt;/P&gt;
&lt;P&gt;Different characters can be used for apostrophes, beyond the single quote that &lt;EM&gt;punct&lt;/EM&gt; preserves, like&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt; ' &lt;FONT face="courier new,courier" color="#808080"&gt;’&lt;/FONT&gt;&amp;nbsp;&lt;SPAN class="box"&gt;’&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jul 2024 23:37:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/how-to-remove-emoji-from-an-xml-file-using-a-unix-command/m-p/934719#M44945</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2024-07-04T23:37:21Z</dc:date>
    </item>
  </channel>
</rss>

