I used proc modeclus code with m=1 and k=2 to 10 by1 that is with different values of k but I could not decide which solution was the best.
Hello,
PROC MODECLUS has a (approximate) nonparametric test for the number of clusters.
Read this:
SAS/STAT 14.1 User's Guide
The MODECLUS Procedure
Details tab: Significance Tests
http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_modeclus_det...
Examples tab: Example 78.3 Cluster Analysis with Significance Tests
http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_modeclus_exa...
Read also:
Introduction to Clustering Procedures
The Number of Clusters
http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_introclus_se...
The last few paragraphs of the latter page deal with nonparametric clustering (and more in particular PROC MODECLUS).
It may be worthwhile to determine the number of clusters by using multiple methods and seeing if they agree.
Good luck,
Koen
Hello,
PROC MODECLUS has a (approximate) nonparametric test for the number of clusters.
Read this:
SAS/STAT 14.1 User's Guide
The MODECLUS Procedure
Details tab: Significance Tests
http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_modeclus_det...
Examples tab: Example 78.3 Cluster Analysis with Significance Tests
http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_modeclus_exa...
Read also:
Introduction to Clustering Procedures
The Number of Clusters
http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_introclus_se...
The last few paragraphs of the latter page deal with nonparametric clustering (and more in particular PROC MODECLUS).
It may be worthwhile to determine the number of clusters by using multiple methods and seeing if they agree.
Good luck,
Koen
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.