The algorithm for text topic discovery is based on the SVD. One of the factors of the SVD gives a numTerm by K(number of topics) matrix of weights or loadings. That matrix is rotated to increase the separation between large and small term weights in each of those vectors. Terms with a larger weight in each of the K vectors "define" the topic and these vectors can be multiplied with any given document vector to produce document scores for its membership in each topic.
Register today and join us virtually on June 16! sasglobalforum.com | #SASGF
Ready to join fellow brilliant minds for the SAS Hackathon?
Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.