SAS Data Science

Building models with SAS Enterprise Miner, SAS Factory Miner, SAS Viya (Machine Learning), SAS Visual Text Analytics, with point-and-click interfaces or programming
BookmarkSubscribeRSS Feed
sasuser_621
Calcite | Level 5

Apologies for the simple question. Is there a way to implement Partition Around Medoids (PAM) clustering, or k-medoids generally, in SAS?

 

I have looked at the official SAS documentation on clustering (also attached). There does not seem to be any procedure that uses k-medoids for clustering, unless I overlooked it.

If there is some SAS code to implement this, or to implement algorithms that apply PAM (e.g. CLARA from Kaufmann & Rousseeuw 1990, CLARANS from Ng & Han 1994), that would be a huge help.

Thank you!

1 REPLY 1
DougWielenga
SAS Employee

I have looked at the official SAS documentation on clustering (also attached). There does not seem to be any procedure that uses k-medoids for clustering, unless I overlooked it.

 

The k-medoids algorithm is similar to k-means but uses actual observations rather than 'center of mass' to identify the middle of each cluster.   Some authors suggest this makes the k-medoid approach more robust against outliers than k-means, but its requirement to compute all pairwise differences between each point in the cluster means that this particular approach does not scale well and is therefore note likely a good choice for most data mining problems. 

 

SAS Enterprise Miner uses observations which are sufficiently separated to create the initial cluster seeds before assigning the observations to clusters, but the final seeds are not required to be actual values.   I do not know of a way to constrain it to mimic the k-medoid approach but neither do I see that it would have a clear advantage in most cases over k-means which you can do with the FASTCLUS procedure or with the Cluster node which uses both FASTCLUS to get initial seeds and CLUSTER to group those seeds.

 

Hope this helps!

Doug

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 3947 views
  • 0 likes
  • 2 in conversation