BookmarkSubscribeRSS Feed
Deirdre
Calcite | Level 5

I've been carrying out text mining in E Miner and have added text topics to my process flow. This works fine for the most part, but suddenly after a number of iterations, some of the important topics that had been generated in previous iterations no longer appear, even though they are some of the most frequently occurring topics, and as far as I can see I did not do anything that should exclude them. Has anyone else had this issue or know anything about how to solve it or what causes it?

I have been thinking that maybe once the term reaches a threshold number of mentions it is no longer pulled out as a topic but I'm not sure this makes sense as a consistent reason for this issue.

Thanks!

1 REPLY 1
JamesCoxPhD
SAS Employee

I guess my question back would be whether you have changed any of them to "user topics".  If you make any change to a topic, such as its name, any of the topic weights, the term or document cutoff, then the topic is changed into a user topic.  Then when you generate new multi-term topics, it intentionally eliminates the one that is closest to the topic you have modified: that way you don't have to look at essentially the same topic twice.  Does this correspond to what you are seeing?

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 940 views
  • 0 likes
  • 2 in conversation