BookmarkSubscribeRSS Feed
Ann_90
Calcite | Level 5

Details about the data source: credit card churn data 

It has only 5% churn data. Because of this rare event, tried changing the decision weights at the data source.

Here is a screenshot of the tree:

I have taken care of the quasi-separation/separation issues  by excluding them from the metadata node which is then connected to the decision tree node.

1. However, the results does not show a proper split between churn and non-churn. How to overcome this problem? Kindly help. Thanks

 

2. Also, wanted to check if choose to change the decision weights at the data source, should the assessment measure in the decision tree properties(panel) only be 'DECISION'? or can I use other assessment measures? What does the LIFT measure mean? I noticed that using the lift measure helped to improve the distribution of the churn and non-churn in my best nodes. But am not sure what the LIFT measure means.

 

3. Under the split search option for decision trees, should 'use decision' be set to Yes since decision weights were selected at the data source?

 

Kindly help with the above doubts! really appreciate it!

 

Ann_90_1-1585879526478.png

 

Ann_90_2-1585879552751.png

 

 

 

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 0 replies
  • 523 views
  • 0 likes
  • 1 in conversation