<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How is variable importance calculated for a gradient boosting node in e-miner? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688887#M8458</link>
    <description>&lt;P&gt;I&amp;nbsp; am attaching the screenshot from SAS Enterprise miner Reference documentation 14.3 where you can find the official computation description.&lt;/P&gt;</description>
    <pubDate>Mon, 05 Oct 2020 14:42:21 GMT</pubDate>
    <dc:creator>gcjfernandez</dc:creator>
    <dc:date>2020-10-05T14:42:21Z</dc:date>
    <item>
      <title>How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/683241#M8417</link>
      <description>&lt;P&gt;I have a simple gradient boosting model (maximum branch = 2 , maximum depth = 1 {Adaboost} ) in e-miner(v 14.1) with binary target and mostly interval inputs(~500 variables). I will be choosing variables if the variable importance &amp;gt; 0.05 for both training and validation datasets. However, I am trying to understand the mathematics behind how the "variable importance" is calculated. I read the documentation (&lt;A href="https://documentation.sas.com/?docsetId=emref&amp;amp;docsetTarget=n0cx4ud03paymdn1kargegadueml.htm&amp;amp;docsetVersion=14.3&amp;amp;locale=en#" target="_self"&gt;decision tree variable importance&lt;/A&gt;&amp;nbsp;) but its very vague. I was wondering if anyone could shed light on how it is calculated with a simple example? It will be very helpful.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Sep 2020 13:45:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/683241#M8417</guid>
      <dc:creator>PadmaroopaK</dc:creator>
      <dc:date>2020-09-11T13:45:33Z</dc:date>
    </item>
    <item>
      <title>Re: How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/685128#M8434</link>
      <description>&lt;P&gt;&lt;EM&gt;feature importance for a single decision tree&lt;/EM&gt; - the amount that each attribute split point improves the performance measure, weighted by the number of observations the node is responsible for. The performance measure may be the purity (Gini index) used to select the split points, or another more specific error function.&lt;BR /&gt;&lt;EM&gt;overall feature importance&lt;/EM&gt; - feature importances averaged across all of the the decision trees within the model.&lt;/P&gt;</description>
      <pubDate>Sat, 19 Sep 2020 00:11:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/685128#M8434</guid>
      <dc:creator>pink_poodle</dc:creator>
      <dc:date>2020-09-19T00:11:56Z</dc:date>
    </item>
    <item>
      <title>Re: How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688430#M8453</link>
      <description>The Gradient Boosting node in SAS EM provides two approaches to evaluating the importance of a variable: split-based and observation-based. &lt;BR /&gt;The split-based approach uses the reduction in the sum of squares from splitting a node, summing over all nodes. &lt;BR /&gt;The observation-based approach uses the increase in a fit statistic due to seeing values of a variable uninformative. &lt;BR /&gt;Measures of variable importance generally underestimate the importance of correlated variables. &lt;BR /&gt;&lt;BR /&gt;Two correlated variables could make a similar contribution to a model. The total contribution is usually divided between them, and neither variable acquires the rank it deserves. &lt;BR /&gt;&lt;BR /&gt;Eliminating either variable generally increases the contribution attributed to the other.&lt;BR /&gt;</description>
      <pubDate>Fri, 02 Oct 2020 06:10:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688430#M8453</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2020-10-02T06:10:22Z</dc:date>
    </item>
    <item>
      <title>Re: How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688876#M8455</link>
      <description>Thank you for your response.&lt;BR /&gt;&lt;BR /&gt;I am looking at the split-based approach in my model. I find that reduction in sum of squares from the splitting node explanation a little abstract. Is there any SAS white paper or any way to see that actual calculation for atleast one variable? I am interested in seeing that back end calculation that produces those numbers.&lt;BR /&gt;&lt;BR /&gt;Thanks!</description>
      <pubDate>Mon, 05 Oct 2020 14:09:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688876#M8455</guid>
      <dc:creator>PadmaroopaK</dc:creator>
      <dc:date>2020-10-05T14:09:43Z</dc:date>
    </item>
    <item>
      <title>Re: How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688878#M8456</link>
      <description>Is there a way for me to see this back end computation in e-miner?</description>
      <pubDate>Mon, 05 Oct 2020 14:11:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688878#M8456</guid>
      <dc:creator>PadmaroopaK</dc:creator>
      <dc:date>2020-10-05T14:11:07Z</dc:date>
    </item>
    <item>
      <title>Re: How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688883#M8457</link>
      <description>Here are some formulas and background:&lt;BR /&gt;&lt;A href="https://documentation.sas.com/?docsetId=casml&amp;amp;docsetTarget=viyaml_treesplit_details02.htm&amp;amp;docsetVersion=3.0&amp;amp;locale=en" target="_blank"&gt;https://documentation.sas.com/?docsetId=casml&amp;amp;docsetTarget=viyaml_treesplit_details02.htm&amp;amp;docsetVersion=3.0&amp;amp;locale=en&lt;/A&gt;</description>
      <pubDate>Mon, 05 Oct 2020 14:24:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688883#M8457</guid>
      <dc:creator>pink_poodle</dc:creator>
      <dc:date>2020-10-05T14:24:29Z</dc:date>
    </item>
    <item>
      <title>Re: How is variable importance calculated for a gradient boosting node in e-miner?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688887#M8458</link>
      <description>&lt;P&gt;I&amp;nbsp; am attaching the screenshot from SAS Enterprise miner Reference documentation 14.3 where you can find the official computation description.&lt;/P&gt;</description>
      <pubDate>Mon, 05 Oct 2020 14:42:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-is-variable-importance-calculated-for-a-gradient-boosting/m-p/688887#M8458</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2020-10-05T14:42:21Z</dc:date>
    </item>
  </channel>
</rss>

