BookmarkSubscribeRSS Feed
Ronein
Onyx | Level 15
Hello
I am building a credit score model
What is acceptable difference in Gini between in-sample( train data) to out of time ? For example; Gini in in-sample is 80% and Gini in out of time data is 82%. Is it good or bad? I afraid that 2% difference means model is not good?
2 REPLIES 2
Ksharp
Super User
Do you want to perform Superiority Testing or Noninferiority Testing ?
But that concept is not from Credit Score, is from bistatistic .

https://communities.sas.com/t5/Statistical-Procedures/How-to-run-Power-analysis-to-capture-sales-lif...

http://support.sas.com/kb/48/616.html
https://support.sas.com/kb/50/700.html

sbxkoenk
SAS Super FREQ

@Ronein wrote:
Hello
I am building a credit score model
What is acceptable difference in Gini between in-sample( train data) to out of time ? For example; Gini in in-sample is 80% and Gini in out of time data is 82%. Is it good or bad? I afraid that 2% difference means model is not good?

Gini in-sample (training data) = 80%

Gini out-of-sample and out-of-time = 82%

The Gini coefficient measures separation power. Similar to how it is used in economics to measure inequality. However for credit risk, the higher the Gini, the better.

 

So, I don't see the problem ... unless you fear a "too good to be true"-type of error.

 

Ciao, Koen

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1500 views
  • 1 like
  • 3 in conversation