<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cluster or FastClus in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72936#M21165</link>
    <description>Thank you very much for your insights, KSharp.  I will look at the SAS doc'n for PRINCOMP.</description>
    <pubDate>Thu, 11 Nov 2010 13:43:33 GMT</pubDate>
    <dc:creator>mjbstats</dc:creator>
    <dc:date>2010-11-11T13:43:33Z</dc:date>
    <item>
      <title>Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72928#M21157</link>
      <description>I am wanting to use either PROC CLUSTER or FASTCLUS to determine if my data can be grouped and if so what is the best grouping.  A colleague ran this for me on a different stat package using k-means dynamic for 10, 8,6,4,3,2 groups and so on.  He took the output and plotted #groups vs the RMSE for each.  where the line inflected represented the optimal grouping.  When I run FASTCLUS or CLUSTER, I don't see RMSE to do a similar check.  How or what do I use in the SAS output for these PROCs to determine when the cluster numbers is the best that it can be?    Is there a metric to gage this with?&lt;BR /&gt;
&lt;BR /&gt;
Thanks.</description>
      <pubDate>Thu, 05 Feb 2009 20:50:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72928#M21157</guid>
      <dc:creator>GSRodney</dc:creator>
      <dc:date>2009-02-05T20:50:04Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72929#M21158</link>
      <description>Hello GSRodney,&lt;BR /&gt;
&lt;BR /&gt;
Are you still uncertain about these procedures?  I am also.  I am new to clustering and am also trying to "match" results using another software program.  In the example I am attempting to match, several scenarios were run and the between/within cluster variance for each was calculated.  Where those ratios seem to hit a point of diminishing returns (in that additional clusters does not differentiate clusters well enough anymore in comparison to the within-cluster variance), an optimal number of clusters begins to appear.  Chosing the actual # of clusters is a somewhat subjective process.&lt;BR /&gt;
&lt;BR /&gt;
BTW, the between/within ratios seem to have been calculated offline with Excel--my application involves fewer than 1,000 clustered values and only 1 dependent variable.&lt;BR /&gt;
&lt;BR /&gt;
Anyway, if you have any additional insight on clustering analysis, measures for choosing numbers of cluster, and SAS procs, please share!&lt;BR /&gt;
&lt;BR /&gt;
Thanks.</description>
      <pubDate>Mon, 08 Nov 2010 15:42:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72929#M21158</guid>
      <dc:creator>mjbstats</dc:creator>
      <dc:date>2010-11-08T15:42:07Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72930#M21159</link>
      <description>Hello GSRodney,&lt;BR /&gt;
&lt;BR /&gt;
Are you still uncertain about these procedures?  I am also.  I am new to clustering and am also trying to "match" results using another software program.  In the example I am attempting to match, several scenarios were run and the between/within cluster variance for each was calculated.  Where those ratios seem to hit a point of diminishing returns (in that additional clusters does not differentiate clusters well enough anymore in comparison to the within-cluster variance), an optimal number of clusters begins to appear.  Chosing the actual # of clusters is a somewhat subjective process.&lt;BR /&gt;
&lt;BR /&gt;
BTW, the between/within ratios seem to have been calculated offline with Excel--my application involves fewer than 1,000 clustered values and only 1 dependent variable.&lt;BR /&gt;
&lt;BR /&gt;
Anyway, if you have any additional insight on clustering analysis, measures for choosing numbers of cluster, and SAS procs, please share!&lt;BR /&gt;
&lt;BR /&gt;
Thanks.</description>
      <pubDate>Mon, 08 Nov 2010 15:43:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72930#M21159</guid>
      <dc:creator>mjbstats</dc:creator>
      <dc:date>2010-11-08T15:43:14Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72931#M21160</link>
      <description>Hi.I remebered There is likely a statistical estimator(but i forgot. &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;   ) to decide how many cluster.&lt;BR /&gt;
Before using proc cluster/fastclus ,Recommend to use proc princomp and proc gplort to plot the two prin1 and prin2 to decide how many clusters you want.&lt;BR /&gt;
And there is not best criteria to decide the number of clusters, different method would yield different cluster .&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
Ksharp

Message was edited by: Ksharp</description>
      <pubDate>Tue, 09 Nov 2010 08:13:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72931#M21160</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2010-11-09T08:13:17Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72932#M21161</link>
      <description>Hi,&lt;BR /&gt;
&lt;BR /&gt;
The stats that you want is CCC, which stands for cubic clustering criterion. Proc Clusters measures the distance between the various points and produces the CCC and Pseudo R Squares. Fastclus basically implements the K-Means Algorithm.&lt;BR /&gt;
&lt;BR /&gt;
Regards,&lt;BR /&gt;
Murphy</description>
      <pubDate>Tue, 09 Nov 2010 11:47:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72932#M21161</guid>
      <dc:creator>goladin</dc:creator>
      <dc:date>2010-11-09T11:47:36Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72933#M21162</link>
      <description>OK, now to show my ignorance (if I haven't already).  I have no experience with PRINCOMP.  Why to run and what do the "1" and "2" you referenced estimate?</description>
      <pubDate>Wed, 10 Nov 2010 15:41:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72933#M21162</guid>
      <dc:creator>mjbstats</dc:creator>
      <dc:date>2010-11-10T15:41:54Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72934#M21163</link>
      <description>Hello,&lt;BR /&gt;
&lt;BR /&gt;
Can you elaborate on the CCC, and what it means?  Also the Pseudo R-Square...&lt;BR /&gt;
&lt;BR /&gt;
I thought K-means was OK for my application, but admit to some fogginess re: hierarchical vs. disjoint clustering methods.&lt;BR /&gt;
&lt;BR /&gt;
(I chose FASTCLUS because I thought I wanted disjoint and the ease of specifying number of clusters--but better understanding doesn't mean best procedure for my simple data.)&lt;BR /&gt;
&lt;BR /&gt;
Thank you!</description>
      <pubDate>Wed, 10 Nov 2010 15:45:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72934#M21163</guid>
      <dc:creator>mjbstats</dc:creator>
      <dc:date>2010-11-10T15:45:17Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72935#M21164</link>
      <description>Hi.&lt;BR /&gt;
Don't say so.I am also a beginner for SAS statistical method.&lt;BR /&gt;
proc PRINCOMP do the principle component analysis which is the oldest multi-variables analysis can use two prin stand for the multi-variables data based on covariance matrix.&lt;BR /&gt;
Then use these two prin as x-axis and y-axis, ploting the observations in this coordination.&lt;BR /&gt;
and you will find some obs very close and some obs very far.&lt;BR /&gt;
Recommend you to look up the SAS documentation about proc princomp.&lt;BR /&gt;
&lt;BR /&gt;
p.s. these two prin demonstrate the the variance this obs can explain.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
Ksharp</description>
      <pubDate>Thu, 11 Nov 2010 01:45:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72935#M21164</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2010-11-11T01:45:06Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72936#M21165</link>
      <description>Thank you very much for your insights, KSharp.  I will look at the SAS doc'n for PRINCOMP.</description>
      <pubDate>Thu, 11 Nov 2010 13:43:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72936#M21165</guid>
      <dc:creator>mjbstats</dc:creator>
      <dc:date>2010-11-11T13:43:33Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster or FastClus</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72937#M21166</link>
      <description>BTW, I have found SAS Technical Report A-108, Cubic Clustering Criterion, and Usage Note 22540:  "How can I tell how many clusters...?" to be very useful.</description>
      <pubDate>Thu, 11 Nov 2010 15:49:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Cluster-or-FastClus/m-p/72937#M21166</guid>
      <dc:creator>mjbstats</dc:creator>
      <dc:date>2010-11-11T15:49:04Z</dc:date>
    </item>
  </channel>
</rss>

