<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to get TFIDF table from Text Parser in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147144#M1471</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;the &amp;lt;nodename&amp;gt;_validout and _testout tables contain the tfidf weightings for the validation and test set respectively.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 17 Sep 2014 21:01:15 GMT</pubDate>
    <dc:creator>JamesCoxPhD</dc:creator>
    <dc:date>2014-09-17T21:01:15Z</dc:date>
    <item>
      <title>How to get TFIDF table from Text Parser</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147141#M1468</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I am using SAS EM 12.1 version now. I would like to get term frequency inversed document frequency table from Text Parser.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I can see that the exported transaction dataset is the table. It has three column: term index column, document index column, and the weight for that term in that document.&amp;nbsp; However, each term is represented as an index, not the actual word.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there a way to find the look up table to map each term's index with the actual word?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 16 Sep 2014 19:28:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147141#M1468</guid>
      <dc:creator>bachfan</dc:creator>
      <dc:date>2014-09-16T19:28:53Z</dc:date>
    </item>
    <item>
      <title>Re: How to get TFIDF table from Text Parser</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147142#M1469</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;You are exactly right, the transaction table is the TFIDF table.&amp;nbsp; If you want to see it as term|role combinations, you can do something like the following with code or in code node (assume that this is on the first diagram, and the first text filter node on that diagram:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;%let filternode_name=emws1.textfilternode;&lt;/P&gt;&lt;P&gt;%let viewname=&amp;lt;whatever data set you want to create&amp;gt;;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; proc sql noprint;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; create view &amp;amp;viewname as&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; select ktrim(term) || '|' || role as _item_, b.*&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; from &amp;amp;filternode_name._term_strings as a, &amp;amp;filternode_name._out_parent as b&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; where b._termnum_=a.key;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; quit;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 16 Sep 2014 19:46:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147142#M1469</guid>
      <dc:creator>JamesCoxPhD</dc:creator>
      <dc:date>2014-09-16T19:46:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to get TFIDF table from Text Parser</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147143#M1470</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Great. I use proc contents and find many more datasets.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But I have a follow-up question. I got great result when using SVM model based on TFIDF matrix as the input variables for classification purposes.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Now I need&amp;nbsp; a scoring dataset, that will go through parsing and filtering. But I do not see a way to get the TFIDF matrix based on the score dataset, which will be subsequently used by SVM. This is because there is only one transaction dataset out of the text filter node. Is this doable?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;IMG alt="tfidf.jpg" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/7436_tfidf.jpg" width="450" /&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 17 Sep 2014 20:58:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147143#M1470</guid>
      <dc:creator>bachfan</dc:creator>
      <dc:date>2014-09-17T20:58:19Z</dc:date>
    </item>
    <item>
      <title>Re: How to get TFIDF table from Text Parser</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147144#M1471</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;the &amp;lt;nodename&amp;gt;_validout and _testout tables contain the tfidf weightings for the validation and test set respectively.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 17 Sep 2014 21:01:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/147144#M1471</guid>
      <dc:creator>JamesCoxPhD</dc:creator>
      <dc:date>2014-09-17T21:01:15Z</dc:date>
    </item>
    <item>
      <title>Re: How to get TFIDF table from Text Parser / Filter</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/225761#M3189</link>
      <description>&lt;P&gt;None of the data sets seem to be the same in SAS EM 13.1, any hints on where are links between the nodes? I can see several possibilities but never played SAS at this level.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Jacob&lt;/P&gt;</description>
      <pubDate>Wed, 16 Sep 2015 05:21:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/225761#M3189</guid>
      <dc:creator>ironfrown</dc:creator>
      <dc:date>2015-09-16T05:21:20Z</dc:date>
    </item>
    <item>
      <title>Re: How to get TFIDF table from Text Parser / Filter</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/225986#M3191</link>
      <description>&lt;P&gt;Actually the answer was in the tiny picture attached to one of the previous messages. The TF-IDF matrx, in its sparse representation, can be found in the TRANSACTION data set returned from the Text Filter, providing the weights have been set to&amp;nbsp;&lt;SPAN style="font-size: 14.5372505187988px; line-height: 20px;"&gt;TF-IDF. Jacob&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Sep 2015 02:56:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-TFIDF-table-from-Text-Parser/m-p/225986#M3191</guid>
      <dc:creator>ironfrown</dc:creator>
      <dc:date>2015-09-17T02:56:35Z</dc:date>
    </item>
  </channel>
</rss>

