<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How root mean squared standard deviation (RMSSTD) is calculated for Text document clustering? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/362981#M9662</link>
    <description>&lt;P&gt;Each document is a K dimensional vector.&lt;/P&gt;
&lt;P&gt;Similarly, the mean of the cluster is a k dimensional&amp;nbsp;vector where each component is an average of the corresponding component for each of the m documents.&lt;/P&gt;
&lt;P&gt;A document error is the square root of the sum of the squared differences of each of its k components with each of the &amp;nbsp;k components of the &amp;nbsp;mean of the cluster.&lt;/P&gt;
&lt;P&gt;The RMSSTD is a an error for the entire cluster so to incorporate all documents from the cluster in this err caculation, it becomes the sum of the squared differences for every component of every document. There are m*k components to sum over in this case.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Russ&lt;/P&gt;</description>
    <pubDate>Wed, 31 May 2017 09:48:38 GMT</pubDate>
    <dc:creator>RussAlbright</dc:creator>
    <dc:date>2017-05-31T09:48:38Z</dc:date>
    <item>
      <title>How root mean squared standard deviation (RMSSTD) is calculated for Text document clustering?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/314459#M9659</link>
      <description>&lt;P&gt;How root mean squared standard deviation (RMSSTD) is calculated for Text document clustering? There is no mathematics is given in any of SAS documentation or Help regarding this.&lt;/P&gt;</description>
      <pubDate>Sat, 26 Nov 2016 13:06:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/314459#M9659</guid>
      <dc:creator>AbhishekVerma1985</dc:creator>
      <dc:date>2016-11-26T13:06:42Z</dc:date>
    </item>
    <item>
      <title>Re: How root mean squared standard deviation (RMSSTD) is calculated for Text document clustering?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/317188#M9660</link>
      <description>&lt;P&gt;if K is the number of dimensions used in the clustering, m is the number of docs in the cluster, and err &amp;nbsp;is &amp;nbsp;the &amp;nbsp;sum of the m*k &amp;nbsp;squared errors, then it looks like it is calculated to be&lt;BR /&gt;&lt;BR /&gt;rmstd = sqrt(err/((m-1)*K)), unless m = 1 and then the value is 0.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Dec 2016 03:00:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/317188#M9660</guid>
      <dc:creator>RussAlbright</dc:creator>
      <dc:date>2016-12-07T03:00:36Z</dc:date>
    </item>
    <item>
      <title>Re: How root mean squared standard deviation (RMSSTD) is calculated for Text document clustering?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/362156#M9661</link>
      <description>&lt;P&gt;Dear Russ,&lt;/P&gt;
&lt;P&gt;It is little&amp;nbsp;confusing to me. I am not able to understand "&lt;SPAN&gt;err &amp;nbsp;is &amp;nbsp;the &amp;nbsp;sum of the m*k &amp;nbsp;squared errors&lt;/SPAN&gt;" it will be very helpful if you explain this.&lt;/P&gt;</description>
      <pubDate>Sat, 27 May 2017 06:01:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/362156#M9661</guid>
      <dc:creator>AbhishekVerma1985</dc:creator>
      <dc:date>2017-05-27T06:01:52Z</dc:date>
    </item>
    <item>
      <title>Re: How root mean squared standard deviation (RMSSTD) is calculated for Text document clustering?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/362981#M9662</link>
      <description>&lt;P&gt;Each document is a K dimensional vector.&lt;/P&gt;
&lt;P&gt;Similarly, the mean of the cluster is a k dimensional&amp;nbsp;vector where each component is an average of the corresponding component for each of the m documents.&lt;/P&gt;
&lt;P&gt;A document error is the square root of the sum of the squared differences of each of its k components with each of the &amp;nbsp;k components of the &amp;nbsp;mean of the cluster.&lt;/P&gt;
&lt;P&gt;The RMSSTD is a an error for the entire cluster so to incorporate all documents from the cluster in this err caculation, it becomes the sum of the squared differences for every component of every document. There are m*k components to sum over in this case.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Russ&lt;/P&gt;</description>
      <pubDate>Wed, 31 May 2017 09:48:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-root-mean-squared-standard-deviation-RMSSTD-is-calculated/m-p/362981#M9662</guid>
      <dc:creator>RussAlbright</dc:creator>
      <dc:date>2017-05-31T09:48:38Z</dc:date>
    </item>
  </channel>
</rss>

