SAS Data Science

Building models with SAS Enterprise Miner, SAS Factory Miner, SAS Viya (Machine Learning), SAS Visual Text Analytics, with point-and-click interfaces or programming
BookmarkSubscribeRSS Feed
manojkuchi
Calcite | Level 5

manojkuchi_0-1616440600998.png

How Does SAS calculate Dunn Index or Davies-Bouldin Index for K-means clustering?
Kindly help.
2 REPLIES 2
sbxkoenk
SAS Super FREQ

Hello,

 

I have done a lot of k-means clustering in the recent past and I have to say that, unfortunately, the DaviesBouldin index (DBI) and Dunn's separation index are not standardly available in proc FastClus (SAS/Stat) nor in proc kClus (Visual Statistics). Nor is the Silhouette score (another technique often used for evaluating clustering algorithms and/or determining the optimal number of clusters). SAS uses many typical statistical measures like RMSSTD (root-mean-square standard deviation) and the like for these purposes.

 

For DBI, you will need to start coding. You can probably use the distance measure (variable) output by the k-means clustering algo (distance of observation to the cluster-center as well as the cluster assignment / cluster membership are available in the out= data set).

There is also a useful procedure called PROC DISTANCE (SAS/Stat). Just mentioning it in case you would need it.

 

Good luck,

Koen

 

sbxkoenk
SAS Super FREQ

In addition to my previous response, ... this paper may also be of interest to you:

 

Paper 3409-2019 (SAS Global Forum 2019)
How to Evaluate Different Clustering Results
Ralph Abbey, SAS Institute Inc.

https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2019/3409-2019.pdf

 

Cheers,

Koen

 

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 2311 views
  • 0 likes
  • 2 in conversation