<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Decision Trees, HPSplit -- Pruning May Be Giving Away Valuable Info? in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628539#M77550</link>
    <description>&lt;P&gt;Wondering if pruning gives up too much valuable information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Pruning.jpg" style="width: 305px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/36479i5DBA79C05C918CBB/image-size/large?v=v2&amp;amp;px=999" role="button" title="Pruning.jpg" alt="Pruning.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Yes, the model becomes smaller, and supposedly more understandable.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But all those tips being whacked contain at least pairs of variables that ought to be acknowledged.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What are your thoughts on this conundrum?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Nicholas Kormanik&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 01 Mar 2020 10:01:55 GMT</pubDate>
    <dc:creator>NKormanik</dc:creator>
    <dc:date>2020-03-01T10:01:55Z</dc:date>
    <item>
      <title>Decision Trees, HPSplit -- Pruning May Be Giving Away Valuable Info?</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628539#M77550</link>
      <description>&lt;P&gt;Wondering if pruning gives up too much valuable information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Pruning.jpg" style="width: 305px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/36479i5DBA79C05C918CBB/image-size/large?v=v2&amp;amp;px=999" role="button" title="Pruning.jpg" alt="Pruning.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Yes, the model becomes smaller, and supposedly more understandable.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But all those tips being whacked contain at least pairs of variables that ought to be acknowledged.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What are your thoughts on this conundrum?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Nicholas Kormanik&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 01 Mar 2020 10:01:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628539#M77550</guid>
      <dc:creator>NKormanik</dc:creator>
      <dc:date>2020-03-01T10:01:55Z</dc:date>
    </item>
    <item>
      <title>Re: Decision Trees, HPSplit -- Pruning May Be Giving Away Valuable Info?</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628547#M77551</link>
      <description>&lt;P&gt;This is true in any modeling, the larger the model (in this case the more branches), the better it will fit (apparently).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But there is also a concept called overfitting, which would usually lead to smaller models, as overfitting is not good and essentially is fitting noise. Those extra branches that should get pruned may in fact be overfitting. How can you tell if something is overfit? Usually, by either crossvalidation, or by fitting the model to a training data set, and then evaluating its performance on validation (and test) data sets&lt;/P&gt;</description>
      <pubDate>Sun, 01 Mar 2020 11:48:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628547#M77551</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2020-03-01T11:48:02Z</dc:date>
    </item>
    <item>
      <title>Re: Decision Trees, HPSplit -- Pruning May Be Giving Away Valuable Info?</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628595#M77558</link>
      <description>To be a contrarian, you can also be getting rid of the noise.</description>
      <pubDate>Sun, 01 Mar 2020 22:23:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Decision-Trees-HPSplit-Pruning-May-Be-Giving-Away-Valuable-Info/m-p/628595#M77558</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2020-03-01T22:23:40Z</dc:date>
    </item>
  </channel>
</rss>

