Hello,
I am download a file using encoding = "UFT-8" and I have found that I have emoji into the xlm file.
I need to remove only the emoji from the xml file without changing the file structure.
How do we do that?
Please note that accented characters need to be kept into the xml file due to French language.
ex;
What I have:
<QID2_TEXT>La dame a été très patience et gentille et que ça fait longtemps que je suis assurée avec vous.
Si c’était possible, je me demandais s’il n’y aurais pas possibilité de diminuer le coût de l’assurance? 🙂</QID2_TEXT>
What I want:
<QID2_TEXT>La dame a été très patience et gentille et que ça fait longtemps que je suis assurée avec vous.
Si c’était possible, je me demandais s’il n’y aurais pas possibilité de diminuer le coût de l’assurance? </QID2_TEXT>
You could remove all unwanted characters using Perl.
Example of the Perl syntax to process a string:
echo ' Cœur coût de l’assurance? 🙂</QID2_TEXT>' | perl -C -pe 's/[^[:alnum:][:space:][:punct:]]+//g'
where you only keep alphanumeric characters, spaces and punctuation symbols results in:
Cœur coût de l’assurance? </QID2_TEXT>
1. Can't you have your SAS session using UTF-8? Your organisation should move to UTF8 to avoid this kind of headaches.
2. Note that the alnum
posix expression is locale-specific
I am loosing the apostrophe. How to keep / allow apostrophe
Cur coût de lassurance? </QID2_TEXT>
How to apply your perl script to the xml file
ex:
perl -C -pe 's/[^èàûéîôÇÉÇÈ"@-_<>[:ascii:][:alnum:][:space:][:punct:]]+//g' /finsys.../VirageSurvey_2.xml
The answer is easy to find if you'd just search.
perl -pe 's/[..]//g' < file.xml > file2.xml
It works well thank you. But I still have the issue with the apostrophe that are missing. Do you know a work around to keep apostrophe since punct does not keep it.
If you have more characters to conserve, just add them to the list.
Different characters can be used for apostrophes, beyond the single quote that punct preserves, like
' ’ ’
Save $250 on SAS Innovate and get a free advance copy of the new SAS For Dummies book! Use the code "SASforDummies" to register. Don't miss out, May 6-9, in Orlando, Florida.
Check out this tutorial series to learn how to build your own steps in SAS Studio.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.