I have SAS Data set with one of the column defined in HTML format. I have to convert this HTML Column into Text format. I have used Perl based HTML parsers, but it is not working.
We have French Alphabet characters in above comments (first line; â, ê, î, ô, û); and it should be available as actual French text characters (â, ê, î, ô, û) post conversion.
Is there any way to do it in SAS?
If not in SAS programming, is there any way to show it as a report as we are displaying above data into a Stored Process report?
Thanks in advance for your help.
So you want to remove html tags and just keep the text?
Or do you want to reuse the html as is in a SAS generated report?
How about removing tags with this simple (but not fail proof) syntax?
data T;
HTML='<a>sfdsfdsf</a>sdsd';
TEXT=prxchange('s/<[^>]*?>//',-1,HTML);
run;
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.