BookmarkSubscribeRSS Feed
lcritch
Calcite | Level 5

I am running a series of random forest models using proc hpforest. I am having a difficult time understanding NEGATIVE OOB gini coefficients from the variable importance output. Is one not to interpret any variables with negative OOB gini coefficients as meaningfully related to the target variable? Thus, should I interpret only those variables with positive OOB gini coefficients? 

 

Also, to get to a final model should I be dropping variables from the model with negative (inbag) gini coefficients and/or negative (inbag) margin coefficients (as was suggested in previous posts)?

 

I can't find a good explanation to all this, so any help is appreciated! Thank you.

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 0 replies
  • 1521 views
  • 1 like
  • 1 in conversation