BookmarkSubscribeRSS Feed
Mgarret
Obsidian | Level 7

All,

 

I’m using  Proc fastclus in SAS to perform a cluster analysis.  I’m trying to figure out a way to determine the relative importance of variables within a cluster.  So, what variables are the primary drivers within a cluster or variables have the most predictive power, so to speak. And rank order. 

 

Here is an  example . 

 

Using the variables A, B, C, D,  E,  F I build a cluster model with 3 segments. 

 

 

   proc fastclus data= DATA_SET maxc=3  out=CUSTER_Results ;
      var A B C D E F ;run;

 

I want to rank order the variables by relative importance within  each cluster.

 

Cluster 1

Rank order of variable Importance:

  1. B – Primary Driver of segment (most predictive)
  2. D
  3. A
  4. C
  5. E
  6. F – Least predictive

Cluster 2

Rank order of variable Importance:

  1. A – Primary Driver of segment (most predictive
  2. C
  3. F
  4. E
  5. B
  6. D – Least predictive

 

Cluster 3

Rank order of variable Importance:

  1. D – Primary Driver of segment (most predictive
  2. A
  3. C
  4. B
  5. F
  6. E – Least predictive

Is there an option for proc fastclus which will do this automatically? In not, any recommendations on how to determine the predictive rank order?

 

Thanks. 

1 REPLY 1
ballardw
Super User

First thing to consider from the documentation:

 

PROC FASTCLUS uses algorithms that place a larger influence on variables with larger variance, so it might be necessary to standardize the variables before performing the cluster analysis.

 

So did you examine your variables before the clustering to identify the variance, or differences in variance, between your variables?

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1488 views
  • 0 likes
  • 2 in conversation