<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Question about Gradient Boosting on large dataset_SAS EM in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Question-about-Gradient-Boosting-on-large-dataset-SAS-EM/m-p/432174#M6624</link>
    <description>&lt;P&gt;Hello&amp;nbsp;YG1992 -&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A first step is to check whether either of these notes are relevant to your situation when your AUC is 0.5.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/kb/61/607.html" target="_blank"&gt;61607 - Gradient Boosting finds no splits, "Will not search for split .. too few acceptable cases" notes are displayed in the node log&lt;/A&gt;&lt;SPAN class="result-type"&gt;-&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/kb/57/674.html" target="_self"&gt;57674 - A "no chart data" message is displayed in an empty plot, or "too few acceptable cases .. will not search for split on variable ." notes occur&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have a great day.&lt;/P&gt;</description>
    <pubDate>Tue, 30 Jan 2018 13:08:18 GMT</pubDate>
    <dc:creator>MikeStockstill</dc:creator>
    <dc:date>2018-01-30T13:08:18Z</dc:date>
    <item>
      <title>Question about Gradient Boosting on large dataset_SAS EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Question-about-Gradient-Boosting-on-large-dataset-SAS-EM/m-p/432117#M6623</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have got several large populations with millions of observations and my task is two-class classification. If I apply gradient boosting directly on the whole dataset (train:validate = 70:30), then I &lt;FONT color="#ff0000"&gt;always get 0.5 AUC and always the same predicted probabilities of class 1 and 2 for each observation&lt;/FONT&gt;; if I draw a sample of 100k or 200k first and apply gradient boosting with same hyper-parameter settings, the results are relatively normal with some AUCs higher than 0.5 and different probabilities of class 1 and 2 for each observation.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would like to ask some SAS EM programmer here: could you please explain this situation? I guess the algorithm just stops updating any parameters at the very beginning but I don't no the exact and concrete reason. Last but not least: there is &lt;FONT color="#ff0000"&gt;no error message&lt;/FONT&gt; when running GBDT for both large and small datasets.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you very much.&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jan 2018 09:29:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Question-about-Gradient-Boosting-on-large-dataset-SAS-EM/m-p/432117#M6623</guid>
      <dc:creator>YG1992</dc:creator>
      <dc:date>2018-01-30T09:29:03Z</dc:date>
    </item>
    <item>
      <title>Re: Question about Gradient Boosting on large dataset_SAS EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Question-about-Gradient-Boosting-on-large-dataset-SAS-EM/m-p/432174#M6624</link>
      <description>&lt;P&gt;Hello&amp;nbsp;YG1992 -&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A first step is to check whether either of these notes are relevant to your situation when your AUC is 0.5.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/kb/61/607.html" target="_blank"&gt;61607 - Gradient Boosting finds no splits, "Will not search for split .. too few acceptable cases" notes are displayed in the node log&lt;/A&gt;&lt;SPAN class="result-type"&gt;-&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/kb/57/674.html" target="_self"&gt;57674 - A "no chart data" message is displayed in an empty plot, or "too few acceptable cases .. will not search for split on variable ." notes occur&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have a great day.&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jan 2018 13:08:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Question-about-Gradient-Boosting-on-large-dataset-SAS-EM/m-p/432174#M6624</guid>
      <dc:creator>MikeStockstill</dc:creator>
      <dc:date>2018-01-30T13:08:18Z</dc:date>
    </item>
  </channel>
</rss>

