<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Seed Initialization Method for Hierarchical Clustering in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/239749#M3520</link>
    <description>&lt;P&gt;Dear all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am a bit confuse about how cluster node in sas miner handle k-means and hierarchical clustering.&lt;/P&gt;&lt;P&gt;I read book 'Data Mining using SAS Enterprise Miner', it says that "The number of Cluster option actually determines if you would like to perform either hierarchical or partitive clustering(K-means). Hierarchical clustering can be performed by selecting the Automatic option.... Conversly, selecting the User Specify option perfors partitive clustering ". When we choose Number of clusters 'Automatic', we can choose clustering method under selection criterion either ward, average or centroid.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My confusion here is, if I choose Number of clusters 'Automatic' which mean that I perform Hierarchical Clustering then what is the use of Seed Initialization Method ? I have tried that different Seed Initialization Method gave different number of clusters on automatic mode. It is confusing since as I understand, hierarchical clustering doesn't initialization seed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Anybody can explain about it? Thank you&lt;/P&gt;</description>
    <pubDate>Thu, 17 Dec 2015 14:31:36 GMT</pubDate>
    <dc:creator>octavianidevi</dc:creator>
    <dc:date>2015-12-17T14:31:36Z</dc:date>
    <item>
      <title>Seed Initialization Method for Hierarchical Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/239749#M3520</link>
      <description>&lt;P&gt;Dear all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am a bit confuse about how cluster node in sas miner handle k-means and hierarchical clustering.&lt;/P&gt;&lt;P&gt;I read book 'Data Mining using SAS Enterprise Miner', it says that "The number of Cluster option actually determines if you would like to perform either hierarchical or partitive clustering(K-means). Hierarchical clustering can be performed by selecting the Automatic option.... Conversly, selecting the User Specify option perfors partitive clustering ". When we choose Number of clusters 'Automatic', we can choose clustering method under selection criterion either ward, average or centroid.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My confusion here is, if I choose Number of clusters 'Automatic' which mean that I perform Hierarchical Clustering then what is the use of Seed Initialization Method ? I have tried that different Seed Initialization Method gave different number of clusters on automatic mode. It is confusing since as I understand, hierarchical clustering doesn't initialization seed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Anybody can explain about it? Thank you&lt;/P&gt;</description>
      <pubDate>Thu, 17 Dec 2015 14:31:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/239749#M3520</guid>
      <dc:creator>octavianidevi</dc:creator>
      <dc:date>2015-12-17T14:31:36Z</dc:date>
    </item>
    <item>
      <title>Re: Seed Initialization Method for Hierarchical Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/239835#M3521</link>
      <description>&lt;P&gt;First if this book is Randall Matignon book then this book is old and based on Miner v4. It still may be helpful to understand more about Miner, but i am not sure if it is the best resource to learn.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Tip-Guidelines-for-Choosing-a-Clustering-Method-in-the-Cluster/ta-p/223483?utm_campaign=Data+Management&amp;amp;utm_medium=social&amp;amp;utm_source=GOOGLE_PLUS_PAGE" target="_self"&gt;Tip: Guidelines for Choosing a Clustering Method in the Cluster Node&lt;/A&gt;&amp;nbsp;,&amp;nbsp;&lt;/STRONG&gt;&amp;nbsp;this articale give good explanation of&amp;nbsp;&lt;SPAN&gt;the &lt;STRONG&gt;automatic&lt;/STRONG&gt; selection process.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It is not&amp;nbsp;&lt;SPAN&gt;k-means and hierarchical clustering. What happen is based on the number of seeds (50 by default) training data are distributed to the closest seed. Then the means of these intial clusters are calculated. After that the&amp;nbsp;hierarchical clustering consolidate these clusters and within the CCC is calculated. Finally the final number of cluster provide the K&amp;nbsp;in K-means and&amp;nbsp;the clusters are obtained using a k-means algorithm.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;So yes, the number of seeds affect the final number of cluster in the&amp;nbsp;automatic selection process.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Dec 2015 20:36:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/239835#M3521</guid>
      <dc:creator>mohamed_zaki</dc:creator>
      <dc:date>2015-12-17T20:36:28Z</dc:date>
    </item>
    <item>
      <title>Re: Seed Initialization Method for Hierarchical Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/306423#M4570</link>
      <description>Hi Mohamed, correct me if I am wrong&lt;BR /&gt;&lt;BR /&gt;Let's say the ward algorithm is completed and using the CCC criterion we find out that K=5. Since ward method acts by merging clusters (starting by assigning one cluster to each observation) then at some point there must have been k-clusters. After K-1 repetitions of merging we are left we 1 cluster, at this point the ward algorithm has done its job and K-mean is initiated.&lt;BR /&gt;&lt;BR /&gt;My question is: Does the K-mean algorithm to create the final number of K-clusters takes into accout the K clusters that were created K-1 repetitions before the end of the ward method or does is randomly chooses K observations as seeds for the final K clusters&lt;BR /&gt;&lt;BR /&gt;Thank you</description>
      <pubDate>Fri, 21 Oct 2016 19:29:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Seed-Initialization-Method-for-Hierarchical-Clustering/m-p/306423#M4570</guid>
      <dc:creator>VanDalucas</dc:creator>
      <dc:date>2016-10-21T19:29:15Z</dc:date>
    </item>
  </channel>
</rss>

