<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: replicability  and stability of clustering  by K means using HP nodes sas Miner in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787855#M38620</link>
    <description>Thanks Reeza for he quick reply. I did choose the global peak value for estimation criterion and choose the number of cluster between 2 and 10 and the result give me 6 clusters. how I can now is my cluster are stable is there any way to know the number of iterations to find the stability? btw I choose the Euclidean distance for the similarity distance.</description>
    <pubDate>Thu, 30 Dec 2021 21:36:41 GMT</pubDate>
    <dc:creator>HLuffy</dc:creator>
    <dc:date>2021-12-30T21:36:41Z</dc:date>
    <item>
      <title>replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787845#M38617</link>
      <description>&lt;P&gt;I create a n clusters using SAS miner &lt;STRONG&gt;HP cluster&lt;/STRONG&gt; nodes( &lt;STRONG&gt;K means&lt;/STRONG&gt; ). but every time I try to replicate the same clusters it give a different clusters. even using EG with different initializations give me different clusters.&amp;nbsp; my questions are :&lt;/P&gt;&lt;P&gt;1- Is&amp;nbsp; there a way to fix this clusters and make it my work &lt;STRONG&gt;replicable&lt;/STRONG&gt;?&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2- If I can't fixe my clusters is there a way to test the &lt;STRONG&gt;stability&lt;/STRONG&gt; of my clusters using for example an overlap rate and said after 75 % we can said that the clusters are stable?&amp;nbsp;&lt;/P&gt;&lt;P&gt;3- I couldn't find any straight forward answer for the stability of the clusters and how it's important.&amp;nbsp;can we speak about the stability of clustering in this situation? is it very important to test the stability before use the clusters? which measures can do that ? is there any nodes in sas miner can do that?&lt;/P&gt;&lt;P&gt;I'm a little bit lost with this question of the stability. thank you for your understanding !!&lt;/P&gt;</description>
      <pubDate>Thu, 30 Dec 2021 20:54:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787845#M38617</guid>
      <dc:creator>HLuffy</dc:creator>
      <dc:date>2021-12-30T20:54:55Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787851#M38618</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;K-Means clustering doesn't have a single unique solution, more so, there's a set of possible solutions and it's about picking one that makes the most sense for your use case. Especially if you change the initialization parameters then the clusters will be different.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If your clusters are unstable it means your clusters are possibly not unique enough and you should reduce the number of clusters to get a more stable solution. How did you pick the number of clusters?&lt;/P&gt;</description>
      <pubDate>Thu, 30 Dec 2021 21:15:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787851#M38618</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2021-12-30T21:15:19Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787855#M38620</link>
      <description>Thanks Reeza for he quick reply. I did choose the global peak value for estimation criterion and choose the number of cluster between 2 and 10 and the result give me 6 clusters. how I can now is my cluster are stable is there any way to know the number of iterations to find the stability? btw I choose the Euclidean distance for the similarity distance.</description>
      <pubDate>Thu, 30 Dec 2021 21:36:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787855#M38620</guid>
      <dc:creator>HLuffy</dc:creator>
      <dc:date>2021-12-30T21:36:41Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787856#M38621</link>
      <description>I'm not aware of a stability measure (it may exist, just not aware) but it's pretty subject dependent as well I suspect. &lt;BR /&gt;For the # of clusters, did you look at the graphs and use the elbow method to determine the optimal # of clusters?&lt;BR /&gt;And just as an FYI stability isn't always possible in a clustering model and you'll almost never get 100% stability with real data. &lt;BR /&gt;</description>
      <pubDate>Thu, 30 Dec 2021 21:44:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787856#M38621</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2021-12-30T21:44:42Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787862#M38624</link>
      <description>the number was determined by ABC criterion. but my pb it's not in the number of the cluster but in the clusters they change after every repetition. is is the number of the cluster that cause that? I know that I can had 100 % stability but is there any heuristic rules or academics way after a certain percentage take decision about the stability?&lt;BR /&gt;</description>
      <pubDate>Thu, 30 Dec 2021 22:13:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787862#M38624</guid>
      <dc:creator>HLuffy</dc:creator>
      <dc:date>2021-12-30T22:13:07Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787864#M38625</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="HLuffy_0-1640904456754.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/67031iDF04BF77B05790D6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="HLuffy_0-1640904456754.png" alt="HLuffy_0-1640904456754.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;here's a picture of selection # of clusters using ABC selection&lt;/P&gt;</description>
      <pubDate>Thu, 30 Dec 2021 22:48:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787864#M38625</guid>
      <dc:creator>HLuffy</dc:creator>
      <dc:date>2021-12-30T22:48:12Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787871#M38627</link>
      <description>Sorry, not super familiar with the output of EM Miner for Clustering. I definitely cannot interpret a graph without axes, context or titles.</description>
      <pubDate>Thu, 30 Dec 2021 22:44:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787871#M38627</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2021-12-30T22:44:04Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787875#M38628</link>
      <description>I updated the graph if that can help you to understand . ABC Statistics — displays the aligned box criterion statistics. The horizontal axis is the number of clusters and the vertical axis is the gap between the error measure from the reference data and the input data. The vertical line indicates the estimated number of clusters for the data.&lt;BR /&gt;Estimation Criterion — specifies the estimation criterion used in the aligned box criterion method. Global Peak Value uses the maximum peak value across all peak values in the gap statistics.</description>
      <pubDate>Thu, 30 Dec 2021 23:06:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787875#M38628</guid>
      <dc:creator>HLuffy</dc:creator>
      <dc:date>2021-12-30T23:06:53Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787935#M38635</link>
      <description>Sorry, this is beyond my current recall for K-Means.</description>
      <pubDate>Fri, 31 Dec 2021 17:34:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787935#M38635</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2021-12-31T17:34:34Z</dc:date>
    </item>
    <item>
      <title>Re: replicability  and stability of clustering  by K means using HP nodes sas Miner</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787947#M38636</link>
      <description>Thank you so much &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13879"&gt;@Reeza&lt;/a&gt; !</description>
      <pubDate>Fri, 31 Dec 2021 18:32:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/replicability-and-stability-of-clustering-by-K-means-using-HP/m-p/787947#M38636</guid>
      <dc:creator>HLuffy</dc:creator>
      <dc:date>2021-12-31T18:32:50Z</dc:date>
    </item>
  </channel>
</rss>

