BookmarkSubscribeRSS Feed
sateh
Fluorite | Level 6
I have a table where I store a document id and a text variable that contains all the text of the document. I would like to know if there is any function in sas to be able to generate a TFIDF vectorization and a BOW vectorization of the complete collection.

 

1 REPLY 1
ballardw
Super User

FEBA,  LDLC and other acronyms like your TFIDF and BOW should really be spelled out or provide an example. Not every one uses the the same acronym for things. I have worked with people that had their jargon to obfuscate that a process was basically a second-derivative of an equation evaluated at specific values. But I had to work through a lot of stuff to determine that was what was actually needed because they kept insisting on using subject-matter specific jargon.

 

So, what is a TFIDF or BOW and where do I find an example.

 

 

 

 

(For those curious enough to read this far FEBA = Forward Edge of the Battle Area, LDLC= Line of Departure Line of contact, both graphic concepts on 1980's military maps (the jargon changes periodically), are closely related and could be "vectorized" but I wouldn't bother )

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 1 reply
  • 174 views
  • 0 likes
  • 2 in conversation