Hi,
I am using the Variable clustering node in order to choose some variables as input to clustering node. Should I also choose an unsigned variable i.e. a variable from CLUS 0 or should I totally ignore these variables and only choose one variable form CLUS1,CLUS2,... ?
Thanks a lot!
I would tend to ignore them unless you have a really strong reason to include them. They could be constants for example.
It sounds like you are using interactive selection, but if you switch to Best Variables you can see that EM rejects the CLUS0 variables.
Hope this helps.
Ray
Thank you very much Rey,
By the way, how many Variables do you usually input in cluster node? 2 to 7 is ok right?
Sure, no problem. While sometimes many more variables are used as inputs there is nothing wrong with clustering a small number. The iris data are often used as an example and it only has 4 inputs.
Ray
Thanks again Ray!
Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.
Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.