<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to get Test Data results of Ensemble of Models (Decision Trees etc.) in bagging technique? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/539447#M7694</link>
    <description>&lt;P&gt;I am working with a Train dataset (which I partitioned into 65% train data and 35% validation data for avoiding overfitting) and a Test dataset. Both are passing through same model pipeline. A part of snapshot looks like below. (I have marked one unit of ensemble containing subsamples and decision trees for convenience of understanding).&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="snapshot.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27587i5D64610D011EE787/image-size/large?v=v2&amp;amp;px=999" role="button" title="snapshot.jpg" alt="snapshot.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;On running above, I am getting only train data results in the output, like below.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="only_train.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27588iD4C8CA33A8FCF5BE/image-size/large?v=v2&amp;amp;px=999" role="button" title="only_train.jpg" alt="only_train.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;I am not getting to see the validation data set and test data set results (neither ROC curve nor cumulative lift). On trying, when connected the nodes like the following:&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="modified.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27589iAD7B08161E3F390A/image-size/large?v=v2&amp;amp;px=999" role="button" title="modified.jpg" alt="modified.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;I got the following results where train, validate and test results are getting displayed.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ROC Chart _ train_period_workstation_purchas.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27590i1E51BABE20EC1ADF/image-size/large?v=v2&amp;amp;px=999" role="button" title="ROC Chart _ train_period_workstation_purchas.jpg" alt="ROC Chart _ train_period_workstation_purchas.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;But I am not sure whether this result is correct for validation and training datasets, because I did connect the impute node directly to the decision trees while modification. I could not add sample nodes for test datasets. I am not being able to change the source data for sample nodes (it is automatically taking the train data set as the source). Can anybody kindly help me how to properly connect the nodes in order to get the results for train, validation and test data please? I am a beginner in SAS.&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;Chandrima&lt;/P&gt;</description>
    <pubDate>Thu, 28 Feb 2019 18:53:37 GMT</pubDate>
    <dc:creator>Chandrima</dc:creator>
    <dc:date>2019-02-28T18:53:37Z</dc:date>
    <item>
      <title>How to get Test Data results of Ensemble of Models (Decision Trees etc.) in bagging technique?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/539447#M7694</link>
      <description>&lt;P&gt;I am working with a Train dataset (which I partitioned into 65% train data and 35% validation data for avoiding overfitting) and a Test dataset. Both are passing through same model pipeline. A part of snapshot looks like below. (I have marked one unit of ensemble containing subsamples and decision trees for convenience of understanding).&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="snapshot.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27587i5D64610D011EE787/image-size/large?v=v2&amp;amp;px=999" role="button" title="snapshot.jpg" alt="snapshot.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;On running above, I am getting only train data results in the output, like below.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="only_train.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27588iD4C8CA33A8FCF5BE/image-size/large?v=v2&amp;amp;px=999" role="button" title="only_train.jpg" alt="only_train.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;I am not getting to see the validation data set and test data set results (neither ROC curve nor cumulative lift). On trying, when connected the nodes like the following:&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="modified.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27589iAD7B08161E3F390A/image-size/large?v=v2&amp;amp;px=999" role="button" title="modified.jpg" alt="modified.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;I got the following results where train, validate and test results are getting displayed.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ROC Chart _ train_period_workstation_purchas.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27590i1E51BABE20EC1ADF/image-size/large?v=v2&amp;amp;px=999" role="button" title="ROC Chart _ train_period_workstation_purchas.jpg" alt="ROC Chart _ train_period_workstation_purchas.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;But I am not sure whether this result is correct for validation and training datasets, because I did connect the impute node directly to the decision trees while modification. I could not add sample nodes for test datasets. I am not being able to change the source data for sample nodes (it is automatically taking the train data set as the source). Can anybody kindly help me how to properly connect the nodes in order to get the results for train, validation and test data please? I am a beginner in SAS.&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;Chandrima&lt;/P&gt;</description>
      <pubDate>Thu, 28 Feb 2019 18:53:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/539447#M7694</guid>
      <dc:creator>Chandrima</dc:creator>
      <dc:date>2019-02-28T18:53:37Z</dc:date>
    </item>
    <item>
      <title>Re: How to get Test Data results of Ensemble of Models (Decision Trees etc.) in bagging technique?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/540515#M7704</link>
      <description>&lt;P&gt;I think the issue is with the Sample node.&amp;nbsp; From the EM reference help:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="verdana,geneva"&gt;The Sample node must be preceded by a node that exports at least one Raw, Train, Transaction, Document, Test, or Score data set. The Input Data node normally precedes the Sample node.&lt;FONT color="#0000FF"&gt; If there is more than one predecessor data set, then the Sample node automatically selects one of the data sets for sampling. The other predecessor data sets are not exported to successor nodes in the process flow.&lt;/FONT&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="verdana,geneva"&gt;To partition the sample into training, validation, and test data sets, follow the Sample node with a Data Partition node. In general, any node can follow a Sample node.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can try using a Control Point node after the Sample node to combine the sampled training partition back with the validation and test partitions before modeling.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 05 Mar 2019 17:34:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/540515#M7704</guid>
      <dc:creator>WendyCzika</dc:creator>
      <dc:date>2019-03-05T17:34:39Z</dc:date>
    </item>
    <item>
      <title>Re: How to get Test Data results of Ensemble of Models (Decision Trees etc.) in bagging technique?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/540612#M7705</link>
      <description>&lt;P&gt;Thanks so much for your prompt solution. It really worked. Now the results are coming fine.&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;Chandrima&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 05 Mar 2019 22:54:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-get-Test-Data-results-of-Ensemble-of-Models-Decision/m-p/540612#M7705</guid>
      <dc:creator>Chandrima</dc:creator>
      <dc:date>2019-03-05T22:54:19Z</dc:date>
    </item>
  </channel>
</rss>

