<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Gradient boosting does not produce any output, but random forest and neural network does in EM in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Gradient-boosting-does-not-produce-any-output-but-random-forest/m-p/386986#M5734</link>
    <description>&lt;P&gt;You appear to have some extremely small categories. &amp;nbsp;Open the Gradient Boosting node results and click on&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;STRONG&gt;View&lt;/STRONG&gt; --&amp;gt; &lt;STRONG&gt;SAS Results&lt;/STRONG&gt; --&amp;gt; &lt;STRONG&gt;Log&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;to view the log from your Gradient Boosting node and look for notes similar to the following:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;...&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Will not search for split on variable A.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Too few acceptable cases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Option MINCATSIZE=5 may apply.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Will not search for split on variable B.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Too few acceptable cases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Option MINCATSIZE=5 may apply&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;....&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;We have seen that message appear when some of the samples had an insufficient number of events and non-events and Gradient Boosting was unable to iterate.&amp;nbsp;&amp;nbsp;Sampling is used at different points to determine split values, and then the model is fit to the whole data set.&amp;nbsp;&amp;nbsp; If there are not enough events, then SAS Enterprise Miner cannot determine where the splits should occur.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;You mention that other models are able to run - examine your regression results to see whether there are many near-zero standard errors.&amp;nbsp;&amp;nbsp;Those values support the suggestion that there is not enough variability in the covariates with respect to the number of events and non-events.&amp;nbsp;&amp;nbsp;Some customers have the opposite problem - infinite standard errors. &amp;nbsp;For more information about this problem, please review &amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp;&lt;STRONG&gt;Usage Note 22599: Understanding and correcting complete or quasi-complete separation problems&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;&lt;A href="http://support.sas.com/kb/22/599.html" target="new"&gt;http://support.sas.com/kb/22/599.html&lt;/A&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Another possibility is that there are too many missing values in the data, or that the&amp;nbsp;missing values are distributed in such a way that no splits can be found with the existing settings. &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Some options, depending on what you think is appropriate, might include:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - lowering the Minimum Categorical Size property to 2 or 3&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - changing the Missing Values property&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - lowering the Leaf Fraction property value&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I hope this helps!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Doug&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 10 Aug 2017 15:10:30 GMT</pubDate>
    <dc:creator>DougWielenga</dc:creator>
    <dc:date>2017-08-10T15:10:30Z</dc:date>
    <item>
      <title>Gradient boosting does not produce any output, but random forest and neural network does in EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Gradient-boosting-does-not-produce-any-output-but-random-forest/m-p/266026#M3945</link>
      <description>&lt;P&gt;I have a dataset that has event rate like:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;'1':&amp;nbsp; 5.14%&lt;/P&gt;
&lt;P&gt;'2': 2.92%&lt;/P&gt;
&lt;P&gt;'3': 3.68%&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I used decision nodes with inverse priors. Random forest and neural network are able to produce the output, but not Gradient boosting.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I changed the default to: shrinkage:0.01, leaf fraction: 0.05, depth: 5, branches: 3&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But still no luck.&lt;/P&gt;</description>
      <pubDate>Mon, 25 Apr 2016 13:06:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Gradient-boosting-does-not-produce-any-output-but-random-forest/m-p/266026#M3945</guid>
      <dc:creator>munitech4u</dc:creator>
      <dc:date>2016-04-25T13:06:36Z</dc:date>
    </item>
    <item>
      <title>Re: Gradient boosting does not produce any output, but random forest and neural network does in EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Gradient-boosting-does-not-produce-any-output-but-random-forest/m-p/386986#M5734</link>
      <description>&lt;P&gt;You appear to have some extremely small categories. &amp;nbsp;Open the Gradient Boosting node results and click on&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;STRONG&gt;View&lt;/STRONG&gt; --&amp;gt; &lt;STRONG&gt;SAS Results&lt;/STRONG&gt; --&amp;gt; &lt;STRONG&gt;Log&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;to view the log from your Gradient Boosting node and look for notes similar to the following:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;...&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Will not search for split on variable A.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Too few acceptable cases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Option MINCATSIZE=5 may apply.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Will not search for split on variable B.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Too few acceptable cases.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;NOTE: Option MINCATSIZE=5 may apply&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;....&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;We have seen that message appear when some of the samples had an insufficient number of events and non-events and Gradient Boosting was unable to iterate.&amp;nbsp;&amp;nbsp;Sampling is used at different points to determine split values, and then the model is fit to the whole data set.&amp;nbsp;&amp;nbsp; If there are not enough events, then SAS Enterprise Miner cannot determine where the splits should occur.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;You mention that other models are able to run - examine your regression results to see whether there are many near-zero standard errors.&amp;nbsp;&amp;nbsp;Those values support the suggestion that there is not enough variability in the covariates with respect to the number of events and non-events.&amp;nbsp;&amp;nbsp;Some customers have the opposite problem - infinite standard errors. &amp;nbsp;For more information about this problem, please review &amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp;&lt;STRONG&gt;Usage Note 22599: Understanding and correcting complete or quasi-complete separation problems&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;&lt;A href="http://support.sas.com/kb/22/599.html" target="new"&gt;http://support.sas.com/kb/22/599.html&lt;/A&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Another possibility is that there are too many missing values in the data, or that the&amp;nbsp;missing values are distributed in such a way that no splits can be found with the existing settings. &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Some options, depending on what you think is appropriate, might include:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - lowering the Minimum Categorical Size property to 2 or 3&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - changing the Missing Values property&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; - lowering the Leaf Fraction property value&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I hope this helps!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Doug&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Aug 2017 15:10:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Gradient-boosting-does-not-produce-any-output-but-random-forest/m-p/386986#M5734</guid>
      <dc:creator>DougWielenga</dc:creator>
      <dc:date>2017-08-10T15:10:30Z</dc:date>
    </item>
  </channel>
</rss>

