<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic SAS DataMiner- Ensemble node in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206302#M2796</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Greetings everybody,&lt;/P&gt;&lt;P&gt;I'm working on small academic project with Sas DataMiner and I noticed that while I use Ensemble Node to merge results of 4 different classifiers I get "weaker" results than separate classifiers.&lt;/P&gt;&lt;P&gt;My diagram looks like this:&lt;/P&gt;&lt;P&gt;&lt;IMG alt="Ensemble Node.png" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/10856_Ensemble Node.png" width="450" /&gt;&lt;/P&gt;&lt;P&gt;The problem is visible on ROC diagram:&lt;/P&gt;&lt;P&gt;&lt;IMG alt="Weaker results.png" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/10857_Weaker results.png" width="450" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any ideas? I read that &lt;/P&gt;&lt;P&gt;"It is important to note that the ensemble model that is created from either approach can be more accurate than the individual models only if the individual models differ."&lt;/P&gt;&lt;P&gt;Is it connected with my problem? And why?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 15 Jun 2015 13:28:15 GMT</pubDate>
    <dc:creator>Vardens</dc:creator>
    <dc:date>2015-06-15T13:28:15Z</dc:date>
    <item>
      <title>SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206302#M2796</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Greetings everybody,&lt;/P&gt;&lt;P&gt;I'm working on small academic project with Sas DataMiner and I noticed that while I use Ensemble Node to merge results of 4 different classifiers I get "weaker" results than separate classifiers.&lt;/P&gt;&lt;P&gt;My diagram looks like this:&lt;/P&gt;&lt;P&gt;&lt;IMG alt="Ensemble Node.png" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/10856_Ensemble Node.png" width="450" /&gt;&lt;/P&gt;&lt;P&gt;The problem is visible on ROC diagram:&lt;/P&gt;&lt;P&gt;&lt;IMG alt="Weaker results.png" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/10857_Weaker results.png" width="450" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any ideas? I read that &lt;/P&gt;&lt;P&gt;"It is important to note that the ensemble model that is created from either approach can be more accurate than the individual models only if the individual models differ."&lt;/P&gt;&lt;P&gt;Is it connected with my problem? And why?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 15 Jun 2015 13:28:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206302#M2796</guid>
      <dc:creator>Vardens</dc:creator>
      <dc:date>2015-06-15T13:28:15Z</dc:date>
    </item>
    <item>
      <title>Re: SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206303#M2797</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Vardens,&lt;/P&gt;&lt;P&gt;Unfortunately you can't know if your Ensemble is going to be better than your models until you try it.&lt;/P&gt;&lt;P&gt;From your plot, it looks like your Ensemble is overfitted. A quick suggestion, connect all the models and the ensemble to the model comparison. Try to identify if there is a model that could be throwing off the Ensemble, and re-run the ensemble node without that model.&lt;/P&gt;&lt;P&gt;Another alternative, do ensembles of 4, 3, and 2 models. I would not try every single combination, but models that have good fit statistics and might be discordant. Not sure if there is a statistical way to test discordance, I usually do try and error.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Let us know how it went!&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Miguel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 15 Jun 2015 14:20:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206303#M2797</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2015-06-15T14:20:31Z</dc:date>
    </item>
    <item>
      <title>Re: SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206304#M2798</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Is Ensemble also learning? &lt;/P&gt;&lt;P&gt;I thought it just count the votes:&lt;/P&gt;&lt;P&gt;Let's say I have 4 models: three votes 'yes',, last one 'no'. Shouldn't it answet 'yes'?&lt;/P&gt;&lt;P&gt;And what happends if I have 2x 'yes' and twice 'no'?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 15 Jun 2015 15:04:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206304#M2798</guid>
      <dc:creator>Vardens</dc:creator>
      <dc:date>2015-06-15T15:04:21Z</dc:date>
    </item>
    <item>
      <title>Re: SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206305#M2799</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;By default the Ensemble node averages the predicted probabilities of your models.&lt;/P&gt;&lt;P&gt;If you have a class target, you can specify Posterior Probability as Voting. Voting can be done in two ways:&lt;/P&gt;&lt;P&gt;-average. The posterior probabilities of an event are averaged, and the event with higher average predicted probabilities is selected.&lt;/P&gt;&lt;P&gt;-proportion. The proportion of predicted events is selected. Priority is given to the descending level.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To answer your specific question, consider this example. For a given observation these are the posterior probabilities of four models for the levels Yes and No:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE border="1" class="jiveBorder" height="124" style="border: 1px solid #000000; width: 433px; height: 114px;"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TH style="text-align: center; color: #ffffff; padding: 2px; background-color: #6690bc;" valign="middle"&gt;Model&lt;/TH&gt;&lt;TH style="text-align: center; background-color: #6690bc; color: #ffffff; padding: 2px;" valign="middle"&gt;&lt;STRONG&gt;Prob of Yes&lt;/STRONG&gt;&lt;/TH&gt;&lt;TH style="text-align: center; background-color: #6690bc; color: #ffffff; padding: 2px;" valign="middle"&gt;&lt;STRONG&gt;Prob of No&lt;/STRONG&gt;&lt;/TH&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="padding: 2px;"&gt;Model 1&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.6&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.4&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="padding: 2px;"&gt;Model 2&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.7&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.3&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="padding: 2px;"&gt;Model 3&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.1&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.9&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="padding: 2px;"&gt;Model 4&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.15&lt;/TD&gt;&lt;TD style="padding: 2px;"&gt;0.85&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Ensemble Voting by Average&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;The average posterior probabilities are 0.3875 for Yes and 0.6125 for No. For this example, the predicted level would be No.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Ensemble Voting by Proportion&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;This ensemble assigns a predicted probability of 0.5 since two out of four models predict each level. In this tie case, the event Yes is given priority because targets are formatted with descending order by default in Enterprise Miner.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I hope this helps!&lt;/P&gt;&lt;P&gt;Miguel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 15 Jun 2015 18:51:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206305#M2799</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2015-06-15T18:51:24Z</dc:date>
    </item>
    <item>
      <title>Re: SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206306#M2800</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thank you. It helped me a lot&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 16 Jun 2015 19:13:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206306#M2800</guid>
      <dc:creator>Vardens</dc:creator>
      <dc:date>2015-06-16T19:13:16Z</dc:date>
    </item>
    <item>
      <title>Re: SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206307#M2801</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Dear Miguel,&lt;/P&gt;&lt;P&gt;&lt;A __default_attr="807455" __jive_macro_name="user" class="jive_macro jive_macro_user" data-objecttype="3" href="https://communities.sas.com/"&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I don't understand the second option (proportion). How the ensemble will select yes class.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 17 Jun 2015 12:01:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206307#M2801</guid>
      <dc:creator>husseinmazaar</dc:creator>
      <dc:date>2015-06-17T12:01:37Z</dc:date>
    </item>
    <item>
      <title>Re: SAS DataMiner- Ensemble node</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206308#M2802</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;For that specific example, 2 out of 4 models select yes, and the other 2 out of 4 select no. For this tie case, where both have 50 % voted probability, Yes is selected because descending levels (alphabetically descending) are given priority by default. You can change this order in the metadata definition.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 17 Jun 2015 12:55:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-DataMiner-Ensemble-node/m-p/206308#M2802</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2015-06-17T12:55:40Z</dc:date>
    </item>
  </channel>
</rss>

