We’re smarter together. Learn from this collection of community knowledge and add your expertise.

SAS Visual Analytics Stop List--What does it stop?

by SAS Employee BobbieWagoner on ‎07-12-2016 12:33 PM - edited on ‎07-12-2016 12:37 PM by Community Manager (511 Views)

SAS Visual Analytics provides you with a pre-built list of words that you would probably want your text analytics to ignore. Actually, there are two lists--one for English and one for German! A stop list enables you to filter out noise in your analysis by ignoring certain irrelevant or commonly used words. By eliminating some commonly used words, such as "a", "and", and "the", you can filter out noise from your analysis.

 

In order to use a stop list, it must be loaded into memory. The Data builder provides an option designed especially for that purpose. To load one of the stop lists in the data builder, select Tool-->Load Text Analytics Stop List.

 

1_loadlist (1).jpg

 

You are given the opportunity to specify which list you would like to load (English or German) and you can specify the metadata registration location and the LASR library. 2_chooselist.jpg 

A table named ENGSTOPL or GRMSTOPS is registered in the location and library that you specify. SAS Visual Analytics supports one stop list for each SAS LASR Analytic Server. You load the stop list (which is a table) to memory by performing the previous steps. If more than one library is registered for SAS LASR Analytic Server, you can use any of them. If you load a stop list more than once or use more than one library, the server uses the last stop list that was loaded to memory. Once you load a stop list, you will see the list in the LASR Tables tab in the administrator. Here is the first 20 words of the English stop list displayed in a list table in the designer. The entire table contains 509 unique words.

 

3_list.jpg

 

 

A site may be tempted to modify the stop list by adding its own values, or even replace the stop list with its own custom list of values. That may work, but it's important to let your site know that custom stop lists are not formally supported as of the 7.3 release of SAS Visual Analytics. Once one of the provided stop lists are loaded, you will be ready to make use of the stop lists in your word clouds and text analytics.        

Your turn
Sign In!

Want to write an article? Sign in with your profile.