<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [Miner] When should I do data partition? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Miner-When-should-I-do-data-partition/m-p/310215#M4666</link>
    <description>&lt;P&gt;It doesn't actually matter. If the variable ends up being used in the final model, the final scoring code will account for it.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is one of the nice features of SAS EM - it can replicate the process from the start including variable transformations.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 08 Nov 2016 19:32:55 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2016-11-08T19:32:55Z</dc:date>
    <item>
      <title>[Miner] When should I do data partition?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Miner-When-should-I-do-data-partition/m-p/310122#M4662</link>
      <description>&lt;P&gt;Hello everyone, I am a student studying SAS Miner the first time on this semester. I am working in a team project and&amp;nbsp;I want to create a new variable for the original data source. I planned to use Transform Variables to type SAS code and add a variable&lt;/P&gt;&lt;P&gt;Net_Gain = Capital_Gain - Capital Lost.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;I wonder if I do the Data Partition before the Transform Variables or Data Partition after Transform Variable node?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. If Data Patition --&amp;gt; Transform Variable: Will I create new variable for only trainning set?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. If Transform Variable --&amp;gt; Data Partition: which means I will create a new variables for the whole data set? will this impact the Scoring data accuracy because the score data doesn't have the Net_Gain variable.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you very much,&amp;nbsp;&lt;/P&gt;&lt;BR /&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/13117i9FA923510659B8CA/image-size/large?v=1.0&amp;amp;px=600" border="0" alt="1.png" title="1.png" /&gt;</description>
      <pubDate>Tue, 08 Nov 2016 15:21:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Miner-When-should-I-do-data-partition/m-p/310122#M4662</guid>
      <dc:creator>ADChau</dc:creator>
      <dc:date>2016-11-08T15:21:27Z</dc:date>
    </item>
    <item>
      <title>Re: [Miner] When should I do data partition?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Miner-When-should-I-do-data-partition/m-p/310215#M4666</link>
      <description>&lt;P&gt;It doesn't actually matter. If the variable ends up being used in the final model, the final scoring code will account for it.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is one of the nice features of SAS EM - it can replicate the process from the start including variable transformations.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Nov 2016 19:32:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Miner-When-should-I-do-data-partition/m-p/310215#M4666</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2016-11-08T19:32:55Z</dc:date>
    </item>
  </channel>
</rss>

