Text mining and content categorization

Text Topics

Reply
New Contributor
Posts: 4

Text Topics

Hello,

i am using Text Topics node and there is no property for the algorithm that will be used to make the clustering. 

Does anybody know if there is any default algorithm?

In Text Miner Help, it only says that the input is term weight.

SAS Employee
Posts: 34

Re: Text Topics

Posted in reply to charismast0

The algorithm for text topic discovery is based on the SVD. One of the factors of the SVD  gives a numTerm by K(number of topics) matrix of weights or loadings. That matrix is rotated to increase the separation between large and small term weights in each of those vectors. Terms with a larger weight in each of the K vectors "define" the topic and these vectors can be multiplied with any given document vector to produce document scores for its membership in each topic.

Ask a Question
Discussion stats
  • 1 reply
  • 103 views
  • 0 likes
  • 2 in conversation