I have SAS Data set with one of the column defined in HTML format. I have to convert this HTML Column into Text format. I have used Perl based HTML parsers, but it is not working.
We have French Alphabet characters in above comments (first line; â, ê, î, ô, û); and it should be available as actual French text characters (â, ê, î, ô, û) post conversion.
Is there any way to do it in SAS?
If not in SAS programming, is there any way to show it as a report as we are displaying above data into a Stored Process report?
Thanks in advance for your help.
So you want to remove html tags and just keep the text?
Or do you want to reuse the html as is in a SAS generated report?
How about removing tags with this simple (but not fail proof) syntax?
data T;
HTML='<a>sfdsfdsf</a>sdsd';
TEXT=prxchange('s/<[^>]*?>//',-1,HTML);
run;
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.