BookmarkSubscribeRSS Feed
charles1
Calcite | Level 5

There is an option in the Text Topics Node in SAS Enterprise Miner 13.1 that allows the user to have "Correlated Topics" (i.e. the options are either "Yes" or "No"). Now, the SAS Text Topics Node uses the resulting text topics and the singular value decomposition create numeric vectors, and my questions are this:

 

1) What method is used to create the topics (LSA, LDA etc.)

 

2) How does SAS represent the topics? (I assume that they are represented as vectors)

 

3) Most important question: If the correlated option is set to "Yes" then what exactly happens? Are the correlated text topic vectors combined into a single vector ?

 

 

Any insights would be appreciated. 

1 REPLY 1
Damien_Mather
Lapis Lazuli | Level 10

to the best of my knowledge:

 

1. SVD

2. therefore either rotated or unrotated eigenvectors, depending on 'allow correlated topics' node property setting

3. no, the topics are somewhat correlated whilst within-vector variance is minimised, which in turn increases the likelihood of a human interpreting the topic theme with a more relevant, grammatical phrase, based on the most influential topic terms.

 

If I'm wrong, I'd appreciate a correction.

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1379 views
  • 0 likes
  • 2 in conversation