I have SAS Data set with one of the column defined in HTML format. I have to convert this HTML Column into Text format. I have used Perl based HTML parsers, but it is not working.
We have French Alphabet characters in above comments (first line; â, ê, î, ô, û); and it should be available as actual French text characters (â, ê, î, ô, û) post conversion.
Is there any way to do it in SAS?
If not in SAS programming, is there any way to show it as a report as we are displaying above data into a Stored Process report?
Thanks in advance for your help.
So you want to remove html tags and just keep the text?
Or do you want to reuse the html as is in a SAS generated report?
How about removing tags with this simple (but not fail proof) syntax?
data T;
HTML='<a>sfdsfdsf</a>sdsd';
TEXT=prxchange('s/<[^>]*?>//',-1,HTML);
run;
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.