<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Determine # of Text Topics in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Determine-of-Text-Topics/m-p/287819#M4290</link>
    <description>&lt;P&gt;For Text Topic node, the default setting for # of topics is 25. I would like to know how to determine the right number. For example, can I base on the # of clusters I get from Text Cluster node to decide how many topics I should set for Text Topic node?&lt;/P&gt;</description>
    <pubDate>Thu, 28 Jul 2016 14:32:11 GMT</pubDate>
    <dc:creator>aha123</dc:creator>
    <dc:date>2016-07-28T14:32:11Z</dc:date>
    <item>
      <title>Determine # of Text Topics</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Determine-of-Text-Topics/m-p/287819#M4290</link>
      <description>&lt;P&gt;For Text Topic node, the default setting for # of topics is 25. I would like to know how to determine the right number. For example, can I base on the # of clusters I get from Text Cluster node to decide how many topics I should set for Text Topic node?&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jul 2016 14:32:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Determine-of-Text-Topics/m-p/287819#M4290</guid>
      <dc:creator>aha123</dc:creator>
      <dc:date>2016-07-28T14:32:11Z</dc:date>
    </item>
    <item>
      <title>Re: Determine # of Text Topics</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Determine-of-Text-Topics/m-p/287877#M4292</link>
      <description>&lt;P&gt;Hi.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I don't think the TT node provides much in the way of guidance, but you might have a look at the HP Text Miner node (HPDM tab)&amp;nbsp;if you have access to it.&amp;nbsp;It gives various options for selecting the number of topics based on percentage of total variance accounted for.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here's a snippet from Help:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;"Suppose that the maximum number of SVD dimensions that you specify for the &lt;STRONG&gt;Max SVD Dimensions&lt;/STRONG&gt; property is maxdim, and these maxdim SVD dimensions account for p% of the total variance. High resolution always generates the maximum number of SVD dimensions (maxdim). For medium resolution, the recommended number of SVD dimensions accounts for 5/6*(p% of the total variance). For low resolution, the recommended number of SVD dimensions accounts for 2/3*(p% of the total variance)."&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;(You could also try posting to&amp;nbsp;the &lt;A href="https://communities.sas.com/t5/SAS-Text-and-Content-Analytics/bd-p/text_analytics" target="_self"&gt;Text Analytics Community&lt;/A&gt;&amp;nbsp;as they are the text mining experts)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Ray&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jul 2016 16:57:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Determine-of-Text-Topics/m-p/287877#M4292</guid>
      <dc:creator>rayIII</dc:creator>
      <dc:date>2016-07-28T16:57:42Z</dc:date>
    </item>
  </channel>
</rss>

