BookmarkSubscribeRSS Feed
saibhavana
Calcite | Level 5

Hi,

 

I have a word document with each heading higlighted in bold and underlined which is of 100 pages, I want to extract the paragraphs from the document based on certain keywords along with the heading.

1 REPLY 1
Damien_Mather
Lapis Lazuli | Level 10
break the single document up into several hundred document files in a folder, each with just one paragraph per document, then import the corpus with the SAS Enterprise Miner text miner text import node, then process the corpus with a text topic node, creating one of more user defined topics based on your keywords. The exported SAS dataset will have a row for each paragraph with an interval measure topic score and a binary presence/absence for each topic added.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1176 views
  • 0 likes
  • 2 in conversation