BookmarkSubscribeRSS Feed
LipizinPinto
Calcite | Level 5

Hello everyone, I've been having an issue trying to understand the correct way to set some multi-word terms in my analysis. Please take a look at the following printscreen:

 

sasp.PNG

This is the SAS Table for Multi-word terms + some terms I have inserted myself. I have a few questions about it:

 

1) Where do these # Documents come from? I don't have any of these terms in my analysis;

2) Why the last ones (starting from Rio de Janeiro) are all 0 (which happens to be the ones I inserted myself)?;

3) And last question: Which is best for setting multi-word terms: setting a multi-word term or adding to the synonyms list? I ask this because for instance I have many documents containing "Rio de Janeiro" but on each document it's written starting with a variation in the case of the first letter, for example: "Rio de janeiro", "rio de Janeiro", "rio de janeiro", "Rio de Janeiro"...and the counting for the multi-word is not aggregating all documents with all variations. I don't know how to overcome this scenario.

 

 

Thank you very much in advance!!

1 REPLY 1
VDD
Ammonite | Level 13 VDD
Ammonite | Level 13

use upcase(term);

if upcase(term) = upcase(myterms) then term_are_equal = 1;

else term_are_equal = 0;

 

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1076 views
  • 0 likes
  • 2 in conversation