BookmarkSubscribeRSS Feed
Lea
Calcite | Level 5 Lea
Calcite | Level 5


Hi all:

I am trying to clean up a field of free text where our sales reps can go in and enter data on how their sales calls went, what they discussed, etc.  I'm able to remove most of the garbage, but some of our reps first put their comments into Word and then copy into the text field.  This brings over all kinds of rtf code that I want to omit from the data, and I can't think of a clean way of removing it all because sometimes it precedes their comments, sometimes it envelopes it, etc.

Does anyone know if there are functions that cover the rtf code??  Here is what I have so far:

SNAGIT_SAS.png

1 REPLY 1
CraigDeVault
SAS Employee

WIthin SAS Text Miner, the Text Import node (or the %tmfilter macro) has the ability to strip out any RTF specific code and then output a copy of the document as a simple text file.  I would try this method as the standard way to create a text document from a RTF document.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 917 views
  • 0 likes
  • 2 in conversation