BookmarkSubscribeRSS Feed
Hue
Fluorite | Level 6 Hue
Fluorite | Level 6

Dear all,

 

I am making Customer segmentation. My data included numeric data (4 variables) and categories data (7 variables).

I used clustering method to classified customer.

If I transfer categories data to dummy and then use K-Means algorithms for new data, It make falsifying clustering result or not?

Could you advice to me the way that I can use to cluster for both Numeric and Categories data on SAS?

 

Thanks for your help!

Hue

 

3 REPLIES 3
Ksharp
Super User

If you mixed up numeric and character variable , Check Generalized Logits Model (a multinomial model ).

abuanuazu
Fluorite | Level 6
Thank you for your solution to the problem.
ilknurkabul
SAS Employee

Clustering methods in EM convert nominal variables into dummy variables. There are different encoding techniques available in the cluster node.  You can try those options and check the compactness of clusters.  If you are using procs, you can also try PROC CLUSTER by using distance/similarity measures (such as jaccard). 

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 1645 views
  • 2 likes
  • 4 in conversation