<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc fastclus: Relative importance of Variables within Clusters in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-fastclus-Relative-importance-of-Variables-within-Clusters/m-p/405371#M21137</link>
    <description>&lt;P&gt;First thing to consider from the documentation:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;PROC FASTCLUS uses algorithms that place a larger influence on variables with larger variance, so it might be necessary to standardize the variables before performing the cluster analysis.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So did you examine your variables before the clustering to identify the variance, or differences in variance, between your variables?&lt;/P&gt;</description>
    <pubDate>Wed, 18 Oct 2017 21:59:41 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2017-10-18T21:59:41Z</dc:date>
    <item>
      <title>Proc fastclus: Relative importance of Variables within Clusters</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-fastclus-Relative-importance-of-Variables-within-Clusters/m-p/405255#M21133</link>
      <description>&lt;P&gt;All,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I’m using&amp;nbsp; &lt;STRONG&gt;Proc fastclus&lt;/STRONG&gt; in SAS to perform a cluster analysis.&amp;nbsp; I’m trying to figure out a way to determine the relative importance of variables within a cluster.&amp;nbsp; So, what&amp;nbsp;&lt;SPAN&gt;variables&lt;/SPAN&gt;&amp;nbsp;are the primary drivers within a cluster or variables have the most predictive power, so to speak. And rank order.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here is an &amp;nbsp;example .&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Using the variables A, B, C, D, &amp;nbsp;E, &amp;nbsp;F I build a cluster model with 3 segments.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&amp;nbsp;&amp;nbsp; proc fastclus data= DATA_SET maxc=3  out=CUSTER_Results ;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; var A B C D E F ;run;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I want to rank order the variables by relative importance within &amp;nbsp;each cluster.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Cluster 1 &lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Rank order of variable Importance:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;B – Primary Driver of segment (most predictive)&lt;/LI&gt;
&lt;LI&gt;D&lt;/LI&gt;
&lt;LI&gt;A&lt;/LI&gt;
&lt;LI&gt;C&lt;/LI&gt;
&lt;LI&gt;E&lt;/LI&gt;
&lt;LI&gt;F – Least predictive&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Cluster 2 &lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Rank order of variable Importance:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;A – Primary Driver of segment (most predictive&lt;/LI&gt;
&lt;LI&gt;C&lt;/LI&gt;
&lt;LI&gt;F&lt;/LI&gt;
&lt;LI&gt;E&lt;/LI&gt;
&lt;LI&gt;B&lt;/LI&gt;
&lt;LI&gt;D – Least predictive&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Cluster 3&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Rank order of variable Importance:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;D – Primary Driver of segment (most predictive&lt;/LI&gt;
&lt;LI&gt;A&lt;/LI&gt;
&lt;LI&gt;C&lt;/LI&gt;
&lt;LI&gt;B&lt;/LI&gt;
&lt;LI&gt;F&lt;/LI&gt;
&lt;LI&gt;E – Least predictive&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;Is there an option for &lt;STRONG&gt;proc fastclus&lt;/STRONG&gt; which will do this automatically? In not, any recommendations on how to determine the predictive rank order?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Oct 2017 15:59:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-fastclus-Relative-importance-of-Variables-within-Clusters/m-p/405255#M21133</guid>
      <dc:creator>Mgarret</dc:creator>
      <dc:date>2017-10-18T15:59:59Z</dc:date>
    </item>
    <item>
      <title>Re: Proc fastclus: Relative importance of Variables within Clusters</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-fastclus-Relative-importance-of-Variables-within-Clusters/m-p/405371#M21137</link>
      <description>&lt;P&gt;First thing to consider from the documentation:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;PROC FASTCLUS uses algorithms that place a larger influence on variables with larger variance, so it might be necessary to standardize the variables before performing the cluster analysis.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So did you examine your variables before the clustering to identify the variance, or differences in variance, between your variables?&lt;/P&gt;</description>
      <pubDate>Wed, 18 Oct 2017 21:59:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-fastclus-Relative-importance-of-Variables-within-Clusters/m-p/405371#M21137</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2017-10-18T21:59:41Z</dc:date>
    </item>
  </channel>
</rss>

