<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Interpreting negative CCC values in a Cluster Analysis in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131723#M6894</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Small correction: The CCC statistic is based on research by Warren Sarle, not Warren Kuhfeld.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 18 Jun 2012 14:06:20 GMT</pubDate>
    <dc:creator>Rick_SAS</dc:creator>
    <dc:date>2012-06-18T14:06:20Z</dc:date>
    <item>
      <title>Interpreting negative CCC values in a Cluster Analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131721#M6892</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I understand the idea od the CCC is to compare the R&lt;SUP&gt;2&lt;/SUP&gt; you get for a given set of clusters with the R&lt;SUP&gt;2&lt;/SUP&gt; you would get by clustering a unfoirmly distributed set of points in a&lt;EM&gt; p&lt;/EM&gt; dimensional space. However what if I get negative values in the CCC plot but the peaks in the CCC plot still indicate a number of clusters that explains a good deal of variation (as evidenced by the corresponding R&lt;SUP style="font-size: 10pt;"&gt;2 &lt;/SUP&gt;value with that number of clusters in the Cluster History table)? Please advise. Thanks!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 03 Jun 2012 23:54:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131721#M6892</guid>
      <dc:creator>GarlandJaeger</dc:creator>
      <dc:date>2012-06-03T23:54:37Z</dc:date>
    </item>
    <item>
      <title>Re: Interpreting negative CCC values in a Cluster Analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131722#M6893</link>
      <description>&lt;P&gt;The CCC is a statistic created by Warren Sarle of SAS nearly 30 years ago.&amp;nbsp; It is documented in Technical Report A-108.&amp;nbsp; On page 48 he writes, "If all values of the CCC are negative and decreasing for two or more clusters, the distribution is probably unimodal or long-tailed."&amp;nbsp; He goes on to say that very negative values may be due to outliers, which he recommends removing (not my recommended best practice).&amp;nbsp; In my experience, the CCC is a heuristic that needs to be triangulated with the approximate R2 as well as the distribution of the cluster frequencies.&amp;nbsp; For the CCC and R2, you want to look at their distribution across a set of solutions (e.g., wrap FASTCLUS in a macro and run solutions from 3 to 30) and examine solutions that have max values for those statistics, even when the CCC is negative.&amp;nbsp; Clusters that are highly irregularly distributed or have 1 or 2 clusters that are large with several small clusters are not appropriate and do not lead to good solutions.&amp;nbsp; In addition, it's important to note that FASTCLUS is a k-means algorithm, meaning that the clusters it produces are compact and spherical in shape.&amp;nbsp; If the shape of your clusters is irregular, you may want to consider a different algorithm, e.g., a nonparametric approach.&lt;/P&gt;</description>
      <pubDate>Mon, 03 Jul 2017 11:48:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131722#M6893</guid>
      <dc:creator>xtc283</dc:creator>
      <dc:date>2017-07-03T11:48:56Z</dc:date>
    </item>
    <item>
      <title>Re: Interpreting negative CCC values in a Cluster Analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131723#M6894</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Small correction: The CCC statistic is based on research by Warren Sarle, not Warren Kuhfeld.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 18 Jun 2012 14:06:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131723#M6894</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2012-06-18T14:06:20Z</dc:date>
    </item>
    <item>
      <title>Re: Interpreting negative CCC values in a Cluster Analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131724#M6895</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I always confuse those two myself.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 19 Apr 2013 18:11:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Interpreting-negative-CCC-values-in-a-Cluster-Analysis/m-p/131724#M6895</guid>
      <dc:creator>WarrenKuhfeld</dc:creator>
      <dc:date>2013-04-19T18:11:42Z</dc:date>
    </item>
  </channel>
</rss>

