Hi all,
Are there any other more advanced and robust ways in SAS Base besides Varclus or principal components that can be used for variable reduction?
I am trying to perform a cluster analysis with over a hundred variable so i was wondering if there is something out there that can help reduce the number of variables as well as providing me with the strongest discriminators for my data.
Kind regards
Hi Chemicalab,
Proc princomp and proc varclus are the go-to methods in Base SAS as you mention.
A different approach if you have access to SAS Enterprise Miner: try calculating the variable importance using a tree-based model node. Then confirm the variable importance of your variables.
Please note that these nodes have the variable selection option set to Yes by default. This means that if you connect any of these nodes to a Cluster node, you will pass only the most important variables (relative variable importance greater or equal to 0.05). A few considerations below.
I hope it helps,
Thanks,
Miguel
Unfortunately i dont have EM so i guess i will have to go with Proc Princ or Varclus, thank you for the reply
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.