BookmarkSubscribeRSS Feed
Alexi
Calcite | Level 5

How would I identify important variables before doing any cluster analysis, I have a big data base of customer (purchase,loyalty..), 150 feature.

I want to work in two step: 1- Eliminate feature which is unuseful for clustering, e.g. an equivalent of correlation analyse between target and feature in supervised clustering. 2-Eliminate correlated feature, use cluster of variables, PCA. Then i do the classification.

I'm very well in the second step, but i have no idea for the first step, what is the equivalent in unsupervised clustering? which exploratory analysis can help me?

2 REPLIES 2
Ksharp
Super User
Check variable cluster analysis.

proc varclus

MelodieRush
SAS Employee

You can also use the Variable Cluster Node under Explore2017-04-07_13-24-40.jpg

Catch the SAS Global Forum keynotes, announcements, and tech content!
sasglobalforum.com | #SASGF



hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1485 views
  • 0 likes
  • 3 in conversation