Hi Everyone,
I have build a decision tree for each month identifying customers who go bad. From April to May the gini coefficient jumps from 30% to around 50%. I think it is largely drive by one node in the decision tree which is high risk where the bads in that node go from 16% to around 50% from April to May. Would this line of thinking be correct or is the gini coefficient for this node around 0 because 50% of the obs are bad and the other 50 are good? I am looking at the equation for gini coefficient in the link below and am not wrapping my head around it.
Any help would be appreciated.
Thanks
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.