<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Seeming Problem with HPSPLIT in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Seeming-Problem-with-HPSPLIT/m-p/384104#M19983</link>
    <description>&lt;P&gt;I thought training data was used to train/validate the model but TEST data was used to determine predictive ability. Training data can allow for over fitting which is why it's a three ways split for data, Training, Validation and Test Data. The Validation data is used for model selection so if it changes, it may change the model selected.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But you'd probably wait for a SAS rep to answer your question, my experience with EM is limited &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Sun, 30 Jul 2017 21:16:13 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2017-07-30T21:16:13Z</dc:date>
    <item>
      <title>Seeming Problem with HPSPLIT</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Seeming-Problem-with-HPSPLIT/m-p/384084#M19982</link>
      <description>&lt;P&gt;Without going into too much detail, I want to say that I've encountered what seems to be a problem with HPSPLIT.&amp;nbsp; I first ran this procedure using a dataset that was divided (using variable "divide") into a training subsample (divide = 1) and a validation subsample (divide = 0).&amp;nbsp; I included the statement:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; partition rolevar=divide(TRAIN='1' VALIDATE='0');&lt;/P&gt;&lt;P&gt;which is supposed to tell SAS to using the training data to estimate a classification tree and the validation data to validate it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To check the results I got, I created a new dataset.&amp;nbsp; I made a new data set containing only the training data.&amp;nbsp; I did this by using&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; if divide = 1;&lt;/P&gt;&lt;P&gt;to subsample the original large data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I ran HPSPLIT on just the training data alone (and without the "partition" statement), I got a different tree.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Why should the absence of the validation data in my second run of HPSPLIT affect the results?&amp;nbsp; It does not seem right.&amp;nbsp; I expected to get the same tree both ways.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Dennis H.&lt;/P&gt;</description>
      <pubDate>Sun, 30 Jul 2017 16:44:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Seeming-Problem-with-HPSPLIT/m-p/384084#M19982</guid>
      <dc:creator>oakHILLS68</dc:creator>
      <dc:date>2017-07-30T16:44:03Z</dc:date>
    </item>
    <item>
      <title>Re: Seeming Problem with HPSPLIT</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Seeming-Problem-with-HPSPLIT/m-p/384104#M19983</link>
      <description>&lt;P&gt;I thought training data was used to train/validate the model but TEST data was used to determine predictive ability. Training data can allow for over fitting which is why it's a three ways split for data, Training, Validation and Test Data. The Validation data is used for model selection so if it changes, it may change the model selected.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But you'd probably wait for a SAS rep to answer your question, my experience with EM is limited &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 30 Jul 2017 21:16:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Seeming-Problem-with-HPSPLIT/m-p/384104#M19983</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2017-07-30T21:16:13Z</dc:date>
    </item>
  </channel>
</rss>

