BookmarkSubscribeRSS Feed
LipizinPinto
Calcite | Level 5

Hello everyone, I've been having an issue trying to understand the correct way to set some multi-word terms in my analysis. Please take a look at the following printscreen:

 

sasp.PNG

This is the SAS Table for Multi-word terms + some terms I have inserted myself. I have a few questions about it:

 

1) Where do these # Documents come from? I don't have any of these terms in my analysis;

2) Why the last ones (starting from Rio de Janeiro) are all 0 (which happens to be the ones I inserted myself)?;

3) And last question: Which is best for setting multi-word terms: setting a multi-word term or adding to the synonyms list? I ask this because for instance I have many documents containing "Rio de Janeiro" but on each document it's written starting with a variation in the case of the first letter, for example: "Rio de janeiro", "rio de Janeiro", "rio de janeiro", "Rio de Janeiro"...and the counting for the multi-word is not aggregating all documents with all variations. I don't know how to overcome this scenario.

 

 

Thank you very much in advance!!

1 REPLY 1
VDD
Ammonite | Level 13 VDD
Ammonite | Level 13

use upcase(term);

if upcase(term) = upcase(myterms) then term_are_equal = 1;

else term_are_equal = 0;

 

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 703 views
  • 0 likes
  • 2 in conversation