<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic HPTMINE multi-term list in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759641#M80878</link>
    <description>&lt;P&gt;Hi all&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am trying to use PROC HPTMINE to parse some text.&lt;/P&gt;
&lt;P&gt;I have some multi-term lists that I want to be treated as units, and this page "seems" to explain how it should be done:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/doc/en/tmhpprcref/14.2/tmhpprcref_hptmine_sect008.htm#tmhpprcref.hptmine.par_mutt" target="_blank"&gt;https://documentation.sas.com/doc/en/tmhpprcref/14.2/tmhpprcref_hptmine_sect008.htm#tmhpprcref.hptmine.par_mutt&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;There is also this page, which defines a slightly different format:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/doc/en/tmref/15.1/n0u9wgweoizcpqn172txiyx8m5h9.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/tmref/15.1/n0u9wgweoizcpqn172txiyx8m5h9.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;This points to the SASHELP.eng_multi dataset, which does seem to follow the latter format.&lt;/P&gt;
&lt;P&gt;However, when I run my code, it won't accept a SAS dataset and when I point SAS to a file as the first link suggests, I get an error telling me that&amp;nbsp;the procedure does not recognize my multiword terms, that I should&amp;nbsp;check the multiword format, and that the multiword list will thus be ignored.&lt;/P&gt;
&lt;P&gt;I have tried to create a file like this (the headers are commented out because I tried, and failed, with and without the headers):&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;DATA _NULL_;
    FILE "c:\Users\..\multi_word phrases.txt";
    /*PUT "Term: Token_type: Role";*/
    PUT "Not Recorded: 3: Noun";
    PUT "not recorded: 3: Noun";
    PUT "Potassium Permanganate: 3: Noun";
    PUT "potassium permanganate: 3: Noun";
    PUT "Oxalic Acid: 3: Noun";
    PUT "oxalic acid: 3: Noun";
    PUT "sodium valproate: 3: Noun";
    PUT "small piece: 3: Noun";
    PUT "outer covering: 3: Noun";
    PUT "unknown tablet: 3: Noun";
RUN;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;So how exactly should this file be created, because I have tried to follow the hep instructions and just can't figure it out.&lt;/P&gt;
&lt;P&gt;I have tried with column-spaced entries, with and without colons, and any other configuration I thought might work.&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Thu, 05 Aug 2021 07:35:20 GMT</pubDate>
    <dc:creator>JacquesR</dc:creator>
    <dc:date>2021-08-05T07:35:20Z</dc:date>
    <item>
      <title>HPTMINE multi-term list</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759641#M80878</link>
      <description>&lt;P&gt;Hi all&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am trying to use PROC HPTMINE to parse some text.&lt;/P&gt;
&lt;P&gt;I have some multi-term lists that I want to be treated as units, and this page "seems" to explain how it should be done:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/doc/en/tmhpprcref/14.2/tmhpprcref_hptmine_sect008.htm#tmhpprcref.hptmine.par_mutt" target="_blank"&gt;https://documentation.sas.com/doc/en/tmhpprcref/14.2/tmhpprcref_hptmine_sect008.htm#tmhpprcref.hptmine.par_mutt&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;There is also this page, which defines a slightly different format:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/doc/en/tmref/15.1/n0u9wgweoizcpqn172txiyx8m5h9.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/tmref/15.1/n0u9wgweoizcpqn172txiyx8m5h9.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;This points to the SASHELP.eng_multi dataset, which does seem to follow the latter format.&lt;/P&gt;
&lt;P&gt;However, when I run my code, it won't accept a SAS dataset and when I point SAS to a file as the first link suggests, I get an error telling me that&amp;nbsp;the procedure does not recognize my multiword terms, that I should&amp;nbsp;check the multiword format, and that the multiword list will thus be ignored.&lt;/P&gt;
&lt;P&gt;I have tried to create a file like this (the headers are commented out because I tried, and failed, with and without the headers):&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;DATA _NULL_;
    FILE "c:\Users\..\multi_word phrases.txt";
    /*PUT "Term: Token_type: Role";*/
    PUT "Not Recorded: 3: Noun";
    PUT "not recorded: 3: Noun";
    PUT "Potassium Permanganate: 3: Noun";
    PUT "potassium permanganate: 3: Noun";
    PUT "Oxalic Acid: 3: Noun";
    PUT "oxalic acid: 3: Noun";
    PUT "sodium valproate: 3: Noun";
    PUT "small piece: 3: Noun";
    PUT "outer covering: 3: Noun";
    PUT "unknown tablet: 3: Noun";
RUN;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;So how exactly should this file be created, because I have tried to follow the hep instructions and just can't figure it out.&lt;/P&gt;
&lt;P&gt;I have tried with column-spaced entries, with and without colons, and any other configuration I thought might work.&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 05 Aug 2021 07:35:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759641#M80878</guid>
      <dc:creator>JacquesR</dc:creator>
      <dc:date>2021-08-05T07:35:20Z</dc:date>
    </item>
    <item>
      <title>Re: HPTMINE multi-term list</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759755#M80880</link>
      <description>Have you seen the example here: &lt;A href="https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/tmhpprcref/tmhpprcref_hptmine_examples07.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/tmhpprcref/tmhpprcref_hptmine_examples07.htm&lt;/A&gt; &lt;BR /&gt;&lt;BR /&gt;Which version of miner are you using?&lt;BR /&gt;Best regards, Jos</description>
      <pubDate>Thu, 05 Aug 2021 16:54:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759755#M80880</guid>
      <dc:creator>JosvanderVelden</dc:creator>
      <dc:date>2021-08-05T16:54:49Z</dc:date>
    </item>
    <item>
      <title>Re: HPTMINE multi-term list</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759829#M80884</link>
      <description>&lt;P&gt;Thanks Jos&lt;/P&gt;
&lt;P&gt;Simple as that. No spaces around :3:Noun&lt;/P&gt;
&lt;P&gt;Pity that&amp;nbsp;&lt;A href="https://documentation.sas.com/doc/en/tmhpprcref/14.2/tmhpprcref_hptmine_sect008.htm#tmhpprcref.hptmine.par_mutt" target="_blank"&gt;https://documentation.sas.com/doc/en/tmhpprcref/14.2/tmhpprcref_hptmine_sect008.htm#tmhpprcref.hptmine.par_mutt&lt;/A&gt;&amp;nbsp;shows the spaces.&lt;/P&gt;</description>
      <pubDate>Thu, 05 Aug 2021 22:47:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/HPTMINE-multi-term-list/m-p/759829#M80884</guid>
      <dc:creator>JacquesR</dc:creator>
      <dc:date>2021-08-05T22:47:22Z</dc:date>
    </item>
  </channel>
</rss>

