<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Multi-word list and Text Filter in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Multi-word-list-and-Text-Filter/m-p/566754#M10025</link>
    <description>&lt;P&gt;Hello everyone, I've been having an issue trying to understand the correct way to set some multi-word terms in my analysis. Please take a look at the following printscreen:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="sasp.PNG" style="width: 508px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/30340i342070849D259718/image-size/large?v=v2&amp;amp;px=999" role="button" title="sasp.PNG" alt="sasp.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;This is the SAS Table for Multi-word terms + some terms I have inserted myself. I have a few questions about it:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1) Where do these # Documents come from? I don't have any of these terms in my analysis;&lt;/P&gt;&lt;P&gt;2) Why the last ones (starting from Rio de Janeiro) are all 0 (which happens to be the ones I inserted myself)?;&lt;/P&gt;&lt;P&gt;3) And last question: Which is best for setting multi-word terms: setting a multi-word term or adding to the synonyms list? I ask this because for instance I have many documents containing "Rio de Janeiro" but on each document it's written starting with a variation in the case of the first letter, for example: "Rio de janeiro", "rio de Janeiro", "rio de janeiro", "Rio de Janeiro"...and the counting for the multi-word is not aggregating all documents with all variations. I don't know how to overcome this scenario.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you very much in advance!!&lt;/P&gt;</description>
    <pubDate>Tue, 18 Jun 2019 00:14:57 GMT</pubDate>
    <dc:creator>LipizinPinto</dc:creator>
    <dc:date>2019-06-18T00:14:57Z</dc:date>
    <item>
      <title>Multi-word list and Text Filter</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Multi-word-list-and-Text-Filter/m-p/566754#M10025</link>
      <description>&lt;P&gt;Hello everyone, I've been having an issue trying to understand the correct way to set some multi-word terms in my analysis. Please take a look at the following printscreen:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="sasp.PNG" style="width: 508px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/30340i342070849D259718/image-size/large?v=v2&amp;amp;px=999" role="button" title="sasp.PNG" alt="sasp.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;This is the SAS Table for Multi-word terms + some terms I have inserted myself. I have a few questions about it:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1) Where do these # Documents come from? I don't have any of these terms in my analysis;&lt;/P&gt;&lt;P&gt;2) Why the last ones (starting from Rio de Janeiro) are all 0 (which happens to be the ones I inserted myself)?;&lt;/P&gt;&lt;P&gt;3) And last question: Which is best for setting multi-word terms: setting a multi-word term or adding to the synonyms list? I ask this because for instance I have many documents containing "Rio de Janeiro" but on each document it's written starting with a variation in the case of the first letter, for example: "Rio de janeiro", "rio de Janeiro", "rio de janeiro", "Rio de Janeiro"...and the counting for the multi-word is not aggregating all documents with all variations. I don't know how to overcome this scenario.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you very much in advance!!&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2019 00:14:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Multi-word-list-and-Text-Filter/m-p/566754#M10025</guid>
      <dc:creator>LipizinPinto</dc:creator>
      <dc:date>2019-06-18T00:14:57Z</dc:date>
    </item>
    <item>
      <title>Re: Multi-word list and Text Filter</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Multi-word-list-and-Text-Filter/m-p/566757#M10026</link>
      <description>&lt;P&gt;use upcase(term);&lt;/P&gt;
&lt;P&gt;if upcase(term) = upcase(myterms) then term_are_equal = 1;&lt;/P&gt;
&lt;P&gt;else term_are_equal = 0;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2019 00:52:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Multi-word-list-and-Text-Filter/m-p/566757#M10026</guid>
      <dc:creator>VDD</dc:creator>
      <dc:date>2019-06-18T00:52:03Z</dc:date>
    </item>
  </channel>
</rss>

