BookmarkSubscribeRSS Feed
Lea
Calcite | Level 5 Lea
Calcite | Level 5


Hi all:

I am trying to clean up a field of free text where our sales reps can go in and enter data on how their sales calls went, what they discussed, etc.  I'm able to remove most of the garbage, but some of our reps first put their comments into Word and then copy into the text field.  This brings over all kinds of rtf code that I want to omit from the data, and I can't think of a clean way of removing it all because sometimes it precedes their comments, sometimes it envelopes it, etc.

Does anyone know if there are functions that cover the rtf code??  Here is what I have so far:

SNAGIT_SAS.png

1 REPLY 1
CraigDeVault
SAS Employee

WIthin SAS Text Miner, the Text Import node (or the %tmfilter macro) has the ability to strip out any RTF specific code and then output a copy of the document as a simple text file.  I would try this method as the standard way to create a text document from a RTF document.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 908 views
  • 0 likes
  • 2 in conversation