BookmarkSubscribeRSS Feed

SAS Text Analytics Tools: An Overview

Started ‎02-06-2013 by
Modified ‎10-05-2015 by
Views 1,780

In response to a question that we received from a participant, I have written up a brief overview of the tools that are mentioned in our online poll on SAS tools. The header links go to the appropriate places on the SAS website, in case anyone wants to read more about the tools. I am most familiar with the Enterprise Content Categorization tool, so if anyone has experience in using the other tools, please share your thoughts about how you use the other tools. 

 

SAS Enterprise Content Categorization uses a combination of both linguistic rules and statistical analysis to classify documents using either hierarchical or flat taxonomies and to extract key information. There are two main components to the SAS Content Categorization.

 

  • Categorization

The Categorization component will classify documents using the criteria that the user chooses. Users can classify documents using algorithms to determine patterns, manually create Boolean rules based on pre-existing taxonomies or a combination of both.

  • Concepts extraction

The Concepts extraction will allow users to extract information that might not be known beforehand. For example, the Concepts extraction component can be set to extract the names of companies from a set of documents based on pre-set criteria.

 

 

 

SAS Ontology Management is used to create ontologies or taxonomies of terms  that are applied by SAS Categorization to classify documents - as well as the definitions of the concepts that can be extracted. One of the useful functions of the SAS Ontology Management is the ability to allow different users to create different taxonomies to suit their needs, but to store the terms in a central repository. This allows users from different parts of an organization to see what terms the other is using in their sections.

 

 

SAS Text Miner discovers patterns in text collections which may not be readily apparent, and can generate a quantitative representation of text, which can be used in conjunction with data mining algorithms, adding text- variables, to traditional structured data analysis. A case study of the Louisville Hospital shows how this tool can be used in the medical field.

 

 

SAS Sentiment Analysis searches for and evaluates internal and external content about a user’s organization or their competitors by identifying positive, negative, neutral and "no sentiment" details within texts and then quantifying these texts to gauge perceptions about the organization. Using a combination of statistical modeling and rule-based natural language processing the Sentiment Analysis software can extract sentiments in real time or over a period of time to show patterns and detailed reactions.

Version history
Last update:
‎10-05-2015 03:02 PM
Updated by:

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

SAS AI and Machine Learning Courses

The rapid growth of AI technologies is driving an AI skills gap and demand for AI talent. Ready to grow your AI literacy? SAS offers free ways to get started for beginners, business leaders, and analytics professionals of all skill levels. Your future self will thank you.

Get started

Article Labels
Article Tags