How is variable importance calculated for a gradient boosting node in ...

PadmaroopaK · Posted 09-11-2020 09:45 AM

I have a simple gradient boosting model (maximum branch = 2 , maximum depth = 1 {Adaboost} ) in e-miner(v 14.1) with binary target and mostly interval inputs(~500 variables). I will be choosing variables if the variable importance > 0.05 for both training and validation datasets. However, I am trying to understand the mathematics behind how the "variable importance" is calculated. I read the documentation (decision tree variable importance ) but its very vague. I was wondering if anyone could shed light on how it is calculated with a simple example? It will be very helpful.

pink_poodle · Posted 09-18-2020 08:10 PM

feature importance for a single decision tree - the amount that each attribute split point improves the performance measure, weighted by the number of observations the node is responsible for. The performance measure may be the purity (Gini index) used to select the split points, or another more specific error function.
overall feature importance - feature importances averaged across all of the the decision trees within the model.

gcjfernandez · Posted 10-02-2020 02:10 AM

The Gradient Boosting node in SAS EM provides two approaches to evaluating the importance of a variable: split-based and observation-based.
The split-based approach uses the reduction in the sum of squares from splitting a node, summing over all nodes.
The observation-based approach uses the increase in a fit statistic due to seeing values of a variable uninformative.
Measures of variable importance generally underestimate the importance of correlated variables.

Two correlated variables could make a similar contribution to a model. The total contribution is usually divided between them, and neither variable acquires the rank it deserves.

Eliminating either variable generally increases the contribution attributed to the other.

PadmaroopaK · Posted 10-05-2020 10:09 AM

Thank you for your response.

I am looking at the split-based approach in my model. I find that reduction in sum of squares from the splitting node explanation a little abstract. Is there any SAS white paper or any way to see that actual calculation for atleast one variable? I am interested in seeing that back end calculation that produces those numbers.

Thanks!

gcjfernandez · Posted 10-05-2020 10:42 AM

I am attaching the screenshot from SAS Enterprise miner Reference documentation 14.3 where you can find the official computation description.

PadmaroopaK · Posted 10-05-2020 10:11 AM

Is there a way for me to see this back end computation in e-miner?

pink_poodle · Posted 10-05-2020 10:24 AM

Here are some formulas and background:
https://documentation.sas.com/?docsetId=casml&docsetTarget=viyaml_treesplit_details02.htm&docsetVers...

How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Re: How is variable importance calculated for a gradient boosting node in e-miner?

Ready to join fellow brilliant minds for the SAS Hackathon?