Hi All, I've a customer data set with approximately 5 million records and data is collected based on customers from past 10 to 15 years. My target is divide the customers into RFM bins. All the recency, frequency and monitory value variables are extremely skewed. For example, if I consider recency there are about 60-70% of the customers with recency 1 & 5-10% with recency between 2 to 5 & 2% between 5 to 10 so on..also about 0.1% above 100. Similar case with monetory value. The monetary varies from 0 to 10,000,000 there are about 30% of the customers who spent < $5 & about 20% of the customers who spent between 5 to 10 & 30% b/w 10 to 100 & 20% between 100 to 1000 & with 10% b/w 100 and 10,000 and 8% b/w 10,000 to 100,000 and so on...about 0.01% > 1,000,000 Similar scenario with Recency variable. I need to decide how the split should be done. I've access to SAS EG. Any idea or solution is much appreciated. Thank you so much in advance for your time. - Avinash
... View more