BookmarkSubscribeRSS Feed
Ronein
Meteorite | Level 14
Hello
I am building a credit score model
What is acceptable difference in Gini between in-sample( train data) to out of time ? For example; Gini in in-sample is 80% and Gini in out of time data is 82%. Is it good or bad? I afraid that 2% difference means model is not good?
2 REPLIES 2
Ksharp
Super User
Do you want to perform Superiority Testing or Noninferiority Testing ?
But that concept is not from Credit Score, is from bistatistic .

https://communities.sas.com/t5/Statistical-Procedures/How-to-run-Power-analysis-to-capture-sales-lif...

http://support.sas.com/kb/48/616.html
https://support.sas.com/kb/50/700.html

sbxkoenk
SAS Super FREQ

@Ronein wrote:
Hello
I am building a credit score model
What is acceptable difference in Gini between in-sample( train data) to out of time ? For example; Gini in in-sample is 80% and Gini in out of time data is 82%. Is it good or bad? I afraid that 2% difference means model is not good?

Gini in-sample (training data) = 80%

Gini out-of-sample and out-of-time = 82%

The Gini coefficient measures separation power. Similar to how it is used in economics to measure inequality. However for credit risk, the higher the Gini, the better.

 

So, I don't see the problem ... unless you fear a "too good to be true"-type of error.

 

Ciao, Koen

hackathon24-white-horiz.png

The 2025 SAS Hackathon Kicks Off on June 11!

Watch the live Hackathon Kickoff to get all the essential information about the SAS Hackathon—including how to join, how to participate, and expert tips for success.

YouTube LinkedIn

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1132 views
  • 1 like
  • 3 in conversation