<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Enterprise Miner  - dataset splits - manual steps in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426943#M6548</link>
    <description>&lt;P&gt;Hello&amp;nbsp;elsolo21 -&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;gt;&amp;nbsp;Do the majority of the work in a SAS code node - I have done this but I don't know how to export these 3 sets out to be able to use it with the rest of the model. Everything is 'stuck' in the workspace.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Since you have already completed the majority of the work in a SAS Code node, you can use these SAS Code node macro variables to create the three data sources:&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;amp;EM_EXPORT_TRAIN&lt;BR /&gt;&amp;nbsp; &amp;amp;EM_EXPORT_VALIDATE&lt;BR /&gt;&amp;nbsp; &amp;amp;EM_EXPORT_TEST&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Those choices are displayed in the SAS Code node Code Editor window.&amp;nbsp; Click the Macro Variables subtab, and scroll down to the Exports section.&amp;nbsp; You can click-and-drag each choice down into the Training Code section, and the ampersand (&amp;amp;) is automatically added for you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Example 1 -&amp;nbsp; if you have one big data set and want to break it into three data sources, then try this code:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;data&amp;nbsp;&amp;amp;EM_EXPORT_TRAIN&amp;nbsp; &amp;nbsp;&amp;amp;EM_EXPORT_VALIDATE&amp;nbsp; &amp;nbsp; &amp;amp;EM_EXPORT_TEST;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; set mybigdata;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; if &amp;lt;condition 1 is true&amp;gt; then output&amp;nbsp;&lt;SPAN&gt;&amp;amp;EM_EXPORT_TRAIN;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; else if &amp;lt;condition 2 is true&amp;gt; then output&amp;nbsp;&amp;amp;EM_EXPORT_VALIDATE;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; else if &amp;lt;condition 3 is true&amp;gt; then output&amp;nbsp;&amp;amp;EM_EXPORT_TEST;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;Example 2 - if you have three data sets that already satisfy each condition, then try this code:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; data &amp;amp;EM_EXPORT_TRAIN;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;set mycondition1data;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; data&amp;nbsp;&amp;nbsp;&amp;amp;EM_EXPORT_VALIDATE;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;set mycondition2data;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; data&amp;nbsp;&amp;amp;EM_EXPORT_TEST;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;set mycondition3data;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Connect from that SAS Code node to your modeling nodes, and they should have access to the three data sources.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have a great day.&lt;/P&gt;</description>
    <pubDate>Thu, 11 Jan 2018 18:14:23 GMT</pubDate>
    <dc:creator>MikeStockstill</dc:creator>
    <dc:date>2018-01-11T18:14:23Z</dc:date>
    <item>
      <title>Enterprise Miner  - dataset splits - manual steps</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426923#M6546</link>
      <description>&lt;P&gt;Hi, I have a dataset with an 'ID' column that isn't quite unique (there's one for each year).&amp;nbsp; I need to find the unique values, split the dataset b/w train/validate/test and then bring in all the associated columns back in for all 3.&amp;nbsp; I've tried two methods but have gotten stuck with both:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1.&amp;nbsp; Do the majority of the work in a SAS code node - I have done this but I don't know how to export these 3 sets out to be able to use it with the rest of the model.&amp;nbsp; Everything is 'stuck' in the workspace.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2.&amp;nbsp; Create a data source of just the unique ID's then use a data partition node for the 3 dataset.&amp;nbsp; Then&amp;nbsp;use a merge node with the original complete dataset and the data partition node. This only merges the training set, however, at least from what I can tell.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There's probably a much more intuitive option 3 I'm not thinking of.&amp;nbsp; Thanks!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jan 2018 17:06:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426923#M6546</guid>
      <dc:creator>elsolo21</dc:creator>
      <dc:date>2018-01-11T17:06:33Z</dc:date>
    </item>
    <item>
      <title>Re: Enterprise Miner  - dataset splits - manual steps</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426943#M6548</link>
      <description>&lt;P&gt;Hello&amp;nbsp;elsolo21 -&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;gt;&amp;nbsp;Do the majority of the work in a SAS code node - I have done this but I don't know how to export these 3 sets out to be able to use it with the rest of the model. Everything is 'stuck' in the workspace.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Since you have already completed the majority of the work in a SAS Code node, you can use these SAS Code node macro variables to create the three data sources:&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;amp;EM_EXPORT_TRAIN&lt;BR /&gt;&amp;nbsp; &amp;amp;EM_EXPORT_VALIDATE&lt;BR /&gt;&amp;nbsp; &amp;amp;EM_EXPORT_TEST&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Those choices are displayed in the SAS Code node Code Editor window.&amp;nbsp; Click the Macro Variables subtab, and scroll down to the Exports section.&amp;nbsp; You can click-and-drag each choice down into the Training Code section, and the ampersand (&amp;amp;) is automatically added for you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Example 1 -&amp;nbsp; if you have one big data set and want to break it into three data sources, then try this code:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;data&amp;nbsp;&amp;amp;EM_EXPORT_TRAIN&amp;nbsp; &amp;nbsp;&amp;amp;EM_EXPORT_VALIDATE&amp;nbsp; &amp;nbsp; &amp;amp;EM_EXPORT_TEST;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; set mybigdata;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; if &amp;lt;condition 1 is true&amp;gt; then output&amp;nbsp;&lt;SPAN&gt;&amp;amp;EM_EXPORT_TRAIN;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; else if &amp;lt;condition 2 is true&amp;gt; then output&amp;nbsp;&amp;amp;EM_EXPORT_VALIDATE;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; else if &amp;lt;condition 3 is true&amp;gt; then output&amp;nbsp;&amp;amp;EM_EXPORT_TEST;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;Example 2 - if you have three data sets that already satisfy each condition, then try this code:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; data &amp;amp;EM_EXPORT_TRAIN;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;set mycondition1data;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; data&amp;nbsp;&amp;nbsp;&amp;amp;EM_EXPORT_VALIDATE;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;set mycondition2data;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; data&amp;nbsp;&amp;amp;EM_EXPORT_TEST;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;set mycondition3data;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Connect from that SAS Code node to your modeling nodes, and they should have access to the three data sources.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have a great day.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jan 2018 18:14:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426943#M6548</guid>
      <dc:creator>MikeStockstill</dc:creator>
      <dc:date>2018-01-11T18:14:23Z</dc:date>
    </item>
    <item>
      <title>Re: Enterprise Miner  - dataset splits - manual steps</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426954#M6549</link>
      <description>&lt;P&gt;Thank you!&amp;nbsp; I was very close to your second solution.&amp;nbsp; I was getting that the train dataset already existed.&amp;nbsp; That was because of an older node I forgot to delete.&amp;nbsp; This was very helpful!&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jan 2018 18:49:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-dataset-splits-manual-steps/m-p/426954#M6549</guid>
      <dc:creator>elsolo21</dc:creator>
      <dc:date>2018-01-11T18:49:24Z</dc:date>
    </item>
  </channel>
</rss>

