Building models with SAS Enterprise Miner, SAS Factory Miner, SAS Visual Data Mining and Machine Learning or just with programming

Clustering with both Numerics and Categories data in SAS EM 13.1

Reply
Occasional Contributor Hue
Occasional Contributor
Posts: 5

Clustering with both Numerics and Categories data in SAS EM 13.1

Dear all,

 

I am making Customer segmentation. My data included numeric data (4 variables) and categories data (7 variables).

I used clustering method to classified customer.

If I transfer categories data to dummy and then use K-Means algorithms for new data, It make falsifying clustering result or not?

Could you advice to me the way that I can use to cluster for both Numeric and Categories data on SAS?

 

Thanks for your help!

Hue

 

Super User
Posts: 9,691

Re: Clustering with both Numerics and Categories data in SAS EM 13.1

If you mixed up numeric and character variable , Check Generalized Logits Model (a multinomial model ).

Occasional Contributor
Posts: 13

Re: Clustering with both Numerics and Categories data in SAS EM 13.1

Thank you for your solution to the problem.
SAS Employee
Posts: 1

Re: Clustering with both Numerics and Categories data in SAS EM 13.1

Clustering methods in EM convert nominal variables into dummy variables. There are different encoding techniques available in the cluster node.  You can try those options and check the compactness of clusters.  If you are using procs, you can also try PROC CLUSTER by using distance/similarity measures (such as jaccard). 

Ask a Question
Discussion stats
  • 3 replies
  • 261 views
  • 2 likes
  • 4 in conversation