BookmarkSubscribeRSS Feed
VanDalucas
Obsidian | Level 7

Hi,

 

I am using the Variable clustering node in order to choose some variables as input to clustering node. Should I also choose an unsigned variable i.e. a variable from CLUS 0 or should I totally ignore these variables and only choose one variable form CLUS1,CLUS2,... ?

 

Thanks a lot!

4 REPLIES 4
rayIII
SAS Employee

I would tend to ignore them unless you have a really strong reason to include them. They could be constants for example. 

 

It sounds like you are using interactive selection, but if you switch to Best Variables you can see that EM rejects the CLUS0 variables. 

 

Hope this helps.

 

Ray

VanDalucas
Obsidian | Level 7

Thank you very much Rey,

By the way, how many Variables do you usually input in cluster node? 2 to 7 is ok right?

rayIII
SAS Employee

Sure, no problem. While sometimes many more variables are used as inputs there is nothing wrong with clustering a small number. The iris data are often used as an example and it only has 4 inputs. 

 

https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_cluster_sec...

 

Ray

VanDalucas
Obsidian | Level 7

Thanks again Ray!

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 1532 views
  • 2 likes
  • 2 in conversation