BookmarkSubscribeRSS Feed
JuliaM
Calcite | Level 5

Hi all,

Could someone tell me what kinds of algorithms are used with the SAS Content Categorization? I do not want to see the exact formula, just the names of the types of algorithms used such as K-nearest neighbor, decision trees, naive Bayes, greedy search, etc. Thanks in advance.

1 REPLY 1
mdwallis
SAS Employee

Hi JuliaM,

There are a variety of algorithms that have been used in SAS Content Categorization, many of which are proprietary implementations.  In general, SAS Content Categorization uses algorithms from finite state automata (FSA) while others are based on graph theory leveraging various searching approaches (e.g., depth first search).  Other more widely known algorithms include using frequent phrase extraction and maximum entropy classification.

Do you have any specific parts of SAS Content Categorization that you are interested in knowing more about with respect to algorithms in use?  I can help in this sense by narrowing in to a smaller scope.

Thank you!

Michael Wallis

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1541 views
  • 0 likes
  • 2 in conversation