BookmarkSubscribeRSS Feed
JeremySmith
Calcite | Level 5

I've been working on a clustering project where I cluster customers based on their thresholds. 

 

For example

 

Distance from centroid:        Avg TH for Cluster:          Target:     

0.0617                                       8977                                     8000

 

 

The goal of this project is to be able to predict thresholds for a customers based on their cluster average threshold and the distance from that cluster's centroid.

 

For the above example, when scaled the values would look like:

 

Distance from centroid:        Avg TH for Cluster:          Target:     

0.89                                           8977                                     8000

 

(0.89) * 8977 = 8000

 

However, there is an obvious scaling issue between the distance from centroid values and the average threshold per cluster value. 

I've explored Proc Transgres and OPSCAL, however I am not sure how I can incorporate the multiplication of the two values to get the target value.

1 REPLY 1
ballardw
Super User
By "Transgres" did you mean Transreg?

Perhaps STDIZE before the clustering?

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 544 views
  • 0 likes
  • 2 in conversation