<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic SAS Enterprise Miner - Decision Tree and Gradient Boosting Run issue in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-Enterprise-Miner-Decision-Tree-and-Gradient-Boosting-Run/m-p/134258#M1215</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello, everyone&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Recently I started using SAS EM and applied Decision Tree/Gradient Boosting model to my data. The data's response rate is 1%. Anyhow, for some reason, the Decision Tree run was suspended for &lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;Run time error &lt;SPAN style="color: #000000;"&gt;and Gradient Boosting run results shows no lift, just ramdom distribution of data&lt;/SPAN&gt;.&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;After I oversampled the data and make the new sample's response rate to be 10% or more, the decision tree and gradient boosting run were successful. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;My question is, is there any parameter in Decision Tree and Gradient Boosting, after I change it, they can run successfully when the sample data's response rate is small, say 1% or below? Thanks,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;Jimmy&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 15 May 2013 17:49:26 GMT</pubDate>
    <dc:creator>jcpenny2002</dc:creator>
    <dc:date>2013-05-15T17:49:26Z</dc:date>
    <item>
      <title>SAS Enterprise Miner - Decision Tree and Gradient Boosting Run issue</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-Enterprise-Miner-Decision-Tree-and-Gradient-Boosting-Run/m-p/134258#M1215</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello, everyone&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Recently I started using SAS EM and applied Decision Tree/Gradient Boosting model to my data. The data's response rate is 1%. Anyhow, for some reason, the Decision Tree run was suspended for &lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;Run time error &lt;SPAN style="color: #000000;"&gt;and Gradient Boosting run results shows no lift, just ramdom distribution of data&lt;/SPAN&gt;.&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;After I oversampled the data and make the new sample's response rate to be 10% or more, the decision tree and gradient boosting run were successful. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;My question is, is there any parameter in Decision Tree and Gradient Boosting, after I change it, they can run successfully when the sample data's response rate is small, say 1% or below? Thanks,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #000066;"&gt;Jimmy&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 May 2013 17:49:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-Enterprise-Miner-Decision-Tree-and-Gradient-Boosting-Run/m-p/134258#M1215</guid>
      <dc:creator>jcpenny2002</dc:creator>
      <dc:date>2013-05-15T17:49:26Z</dc:date>
    </item>
    <item>
      <title>Re: SAS Enterprise Miner - Decision Tree and Gradient Boosting Run issue</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-Enterprise-Miner-Decision-Tree-and-Gradient-Boosting-Run/m-p/388557#M5856</link>
      <description>&lt;P&gt;Your question is a common one and is discussed in part in SAS Note 47965 available at&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &lt;A href="http://support.sas.com/kb/47/965.html" target="_self"&gt;http://support.sas.com/kb/47/965.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In general, it is always good to check running a Tree model if your Gradient Boosting node is not running since Gradient Boosting models. The strategies described in the note above will likely help you with your original data without oversampling. &amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;For Gradient Boosting, t&lt;SPAN&gt;ry the following:&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - lowering the Minimum Categorical Size property to 2 or 3&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - changing the Missing Values property&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To investigate further, open the Gradient Boosting node results and click on&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;STRONG&gt;View&lt;/STRONG&gt;&amp;nbsp;--&amp;gt;&amp;nbsp;&lt;STRONG&gt;SAS Results&lt;/STRONG&gt;&amp;nbsp;--&amp;gt;&amp;nbsp;&lt;STRONG&gt;Log&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;to view the log from your Gradient Boosting node and look for notes similar to the following:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;...&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Will not search for split on variable A.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Too few acceptable cases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Option MINCATSIZE=5 may apply.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Will not search for split on variable B.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Too few acceptable cases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Option MINCATSIZE=5 may apply&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;....&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;We have seen that message appear when some of the samples had an insufficient number of events and non-events and Gradient Boosting was unable to iterate.&amp;nbsp;&amp;nbsp;Sampling is used at different points to determine split values, and then the model is fit to the whole data set.&amp;nbsp;&amp;nbsp; If there are not enough events, then SAS Enterprise Miner cannot determine where the splits should occur.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Are other models able to run such as a regression model or decision tree? &amp;nbsp;If so, examine your regression results to see whether there are many near-zero standard errors. &amp;nbsp;Some customers have the opposite problem - infinite standard errors. &amp;nbsp;For more information about this problem, please review &amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp;&lt;STRONG&gt;Usage Note 22599: Understanding and correcting complete or quasi-complete separation problems&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;&lt;A href="http://support.sas.com/kb/22/599.html" target="new" rel="nofollow noopener noreferrer noopener noreferrer"&gt;http://support.sas.com/kb/22/599.html&lt;/A&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I hope this helps!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Doug&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Aug 2017 18:22:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-Enterprise-Miner-Decision-Tree-and-Gradient-Boosting-Run/m-p/388557#M5856</guid>
      <dc:creator>DougWielenga</dc:creator>
      <dc:date>2017-08-16T18:22:10Z</dc:date>
    </item>
  </channel>
</rss>

