<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Variable importance in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-importance/m-p/388150#M5817</link>
    <description>&lt;P&gt;The regression and tree-based models work in very different ways. &amp;nbsp;The regression builds a model that attempts to describe the relationship between the target and involves estimating parameters that are then multiplied by variable values to predict the outcome. Decision trees do not build a parametric model and must be evaluated differently. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are also many differences in how regression and decision tree models perform on different data sets. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Regression models do a better job of describing smoothly changing relationships than a decision tree which would need to create many cutpoints to accomplish the same.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Decision trees handle missing values whereas regression models must have those values imputed meaning the two methods are not even working on the same training data when missing values are present in the original training data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Regression models require you to specify the exact functional form of the relationship whereas decision trees do not assume any specific relationship.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Decision trees are (typically) far more flexible than regression models allowing them to automatically model complex interactions and non-linear relationships that would need to be described explicitly in a regression model.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * You cannot pick one of these methods as the superior method overall since it depends on the data you are fitting&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For all these reasons, you should not expect that the variable importance reported by a decision tree is going to parallel what you would see from the corresponding regression model fit to the same data. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;I hope this helps!&lt;BR /&gt;Doug&lt;/P&gt;</description>
    <pubDate>Tue, 15 Aug 2017 14:03:17 GMT</pubDate>
    <dc:creator>DougWielenga</dc:creator>
    <dc:date>2017-08-15T14:03:17Z</dc:date>
    <item>
      <title>Variable importance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-importance/m-p/83902#M560</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I can't reconcile the variable importance using decision trees and regression on sas on demand for academics 6.2 miner. In particular I'm&amp;nbsp; using the data PVA97NK. In otherwords, their ranking are not the same.&lt;/P&gt;&lt;P&gt;Any help is welcomed.&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 15 Apr 2012 00:00:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Variable-importance/m-p/83902#M560</guid>
      <dc:creator>Saszealot</dc:creator>
      <dc:date>2012-04-15T00:00:00Z</dc:date>
    </item>
    <item>
      <title>Re: Variable importance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-importance/m-p/388150#M5817</link>
      <description>&lt;P&gt;The regression and tree-based models work in very different ways. &amp;nbsp;The regression builds a model that attempts to describe the relationship between the target and involves estimating parameters that are then multiplied by variable values to predict the outcome. Decision trees do not build a parametric model and must be evaluated differently. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are also many differences in how regression and decision tree models perform on different data sets. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Regression models do a better job of describing smoothly changing relationships than a decision tree which would need to create many cutpoints to accomplish the same.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Decision trees handle missing values whereas regression models must have those values imputed meaning the two methods are not even working on the same training data when missing values are present in the original training data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Regression models require you to specify the exact functional form of the relationship whereas decision trees do not assume any specific relationship.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * Decision trees are (typically) far more flexible than regression models allowing them to automatically model complex interactions and non-linear relationships that would need to be described explicitly in a regression model.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; * You cannot pick one of these methods as the superior method overall since it depends on the data you are fitting&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For all these reasons, you should not expect that the variable importance reported by a decision tree is going to parallel what you would see from the corresponding regression model fit to the same data. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;I hope this helps!&lt;BR /&gt;Doug&lt;/P&gt;</description>
      <pubDate>Tue, 15 Aug 2017 14:03:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Variable-importance/m-p/388150#M5817</guid>
      <dc:creator>DougWielenga</dc:creator>
      <dc:date>2017-08-15T14:03:17Z</dc:date>
    </item>
  </channel>
</rss>

