BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
mrs_mee
Calcite | Level 5

Hi all,

I feel like this is an easy question that i just can't find an answer to.  In my text topic node results, it returns 15 topics each topic containing 5 words.  In the results set there is also a column labeled "Number of Terms" which has a value of 1-4 - most of the 15 topics show a value of 1 in this column, a few 2s, 3s and 4s.  What is the significance of this column "Number of terms"? What does this value mean? It does not coorelate with the number of terms shown in the actual topic column.  Appreciate any help!

1 ACCEPTED SOLUTION

Accepted Solutions
RussAlbright
SAS Employee

I think that value is the number of terms that had a weight above the termcutoff threshold. Is it possible that you have 15 topics but your data set is so small that not many terms made the cutoff? I just looked at a topics table and the values there are much larger than 5.


Register today and join us virtually on June 16!
sasglobalforum.com | #SASGF

View now: on-demand content for SAS users

View solution in original post

5 REPLIES 5
rayIII
SAS Employee

Hi, Mrs. Mee. 

 

The TT node displays 5 terms per topic but these are only the top 5. There could be many more terms associated with a given topic. 

 

I think the TT node excludes terms that do not meet the term cutoff (which you can view and edit in the Topic Viewer). 

 

To confirm, try opening the Topic Viewer (available in Node Properties for the TT node) and look at the Terms table. How many terms do you see there for each topic? How many of them meet the Term Cutoff in the Topics table? 

 

Hope this helps. 

 

Ray

mrs_mee
Calcite | Level 5

Thanks for the reply.  I am OK with the number of terms per topic, where I am confused is the counter value under the label of "Number of terms".  You would think it would coorelate with the top 5 and read 5 but it does not.  It is always less that 5.  I am trying to understand the definition for this particular value column.

Thanks!

rayIII
SAS Employee

Hi. Have you tried exploring your topics in the Topic Viewer? It is a different window than the results for the Text Topic node.  You'll find it in node properties and Help should have an example. 

 

Ray

RussAlbright
SAS Employee

I think that value is the number of terms that had a weight above the termcutoff threshold. Is it possible that you have 15 topics but your data set is so small that not many terms made the cutoff? I just looked at a topics table and the values there are much larger than 5.


Register today and join us virtually on June 16!
sasglobalforum.com | #SASGF

View now: on-demand content for SAS users

mrs_mee
Calcite | Level 5

Thanks Russ!!  Yes, that is entirely possible and the number would make sense if that were the case.

Appreciate your feedback!

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 1491 views
  • 0 likes
  • 3 in conversation