<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Assign Observation to Train and Validation using the Values for a Particular Column in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537047#M7664</link>
    <description>&lt;P&gt;I had already prepared the Training and Validation Dataset using the time considerations which needs a specific approach. I had created a separate variable ('TrainingOrValidation') to know whether the observation belongs to a 'Training' or 'Validation' set. Is there any way in SAS E-miner to assign the observation based upon the column values as above. I dont want SAS Miner to split itself as shown here and I am looking for the ways to inform the SAS E-miner about which are the observations belong to Training and which of those remaining belongs to Validation.&amp;nbsp;&lt;BR /&gt;I would really appreciate any help.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Thanks&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Data Partition based upon the Value in Column Variable.PNG" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27304iB98C1F417440F972/image-size/large?v=v2&amp;amp;px=999" role="button" title="Data Partition based upon the Value in Column Variable.PNG" alt="Data Partition based upon the Value in Column Variable.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 20 Feb 2019 12:07:06 GMT</pubDate>
    <dc:creator>ggfggrr</dc:creator>
    <dc:date>2019-02-20T12:07:06Z</dc:date>
    <item>
      <title>Assign Observation to Train and Validation using the Values for a Particular Column</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537047#M7664</link>
      <description>&lt;P&gt;I had already prepared the Training and Validation Dataset using the time considerations which needs a specific approach. I had created a separate variable ('TrainingOrValidation') to know whether the observation belongs to a 'Training' or 'Validation' set. Is there any way in SAS E-miner to assign the observation based upon the column values as above. I dont want SAS Miner to split itself as shown here and I am looking for the ways to inform the SAS E-miner about which are the observations belong to Training and which of those remaining belongs to Validation.&amp;nbsp;&lt;BR /&gt;I would really appreciate any help.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Thanks&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Data Partition based upon the Value in Column Variable.PNG" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27304iB98C1F417440F972/image-size/large?v=v2&amp;amp;px=999" role="button" title="Data Partition based upon the Value in Column Variable.PNG" alt="Data Partition based upon the Value in Column Variable.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 12:07:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537047#M7664</guid>
      <dc:creator>ggfggrr</dc:creator>
      <dc:date>2019-02-20T12:07:06Z</dc:date>
    </item>
    <item>
      <title>Re: Assign Observation to Train and Validation using the Values for a Particular Column</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537224#M7665</link>
      <description>&lt;P&gt;I think you would need to do something like this in a SAS Code node in place of the Data Partition node:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;data &amp;amp;em_export_train &amp;amp;em_export_validate;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;set &amp;amp;em_import_data;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;if strip(&lt;SPAN&gt;TrainingOrValidation)&lt;/SPAN&gt;='Training' then output &amp;amp;em_export_train;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; else if strip(&lt;SPAN&gt;TrainingOrValidation)&lt;/SPAN&gt;='Validation' then output &amp;amp;em_export_validate;&lt;/P&gt;
&lt;P&gt;run;&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 21:45:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537224#M7665</guid>
      <dc:creator>WendyCzika</dc:creator>
      <dc:date>2019-02-20T21:45:14Z</dc:date>
    </item>
    <item>
      <title>Re: Assign Observation to Train and Validation using the Values for a Particular Column</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537247#M7666</link>
      <description>&lt;P&gt;Thanks for your quick help. However,&amp;nbsp; Can&amp;nbsp; you kindly help me to understand the following;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. I would appreciate how these variables are named and helps in splitting the dataset. Are the data sets names&amp;nbsp;&lt;SPAN&gt;em_export_train and&amp;nbsp;em_export_validate are automatically understood by SAS that these observations belong to Training and Validation respectively.?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;These names can be of any name?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;2. Do I still need Data Partition Node after the SAS code? or I can directly connect the SAS code to the Variables clustering/Integrative Grouping/Scorecard?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks again&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Kind regards,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Mari&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 20 Feb 2019 22:45:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537247#M7666</guid>
      <dc:creator>ggfggrr</dc:creator>
      <dc:date>2019-02-20T22:45:50Z</dc:date>
    </item>
    <item>
      <title>Re: Assign Observation to Train and Validation using the Values for a Particular Column</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537398#M7668</link>
      <description>&lt;P&gt;1. Yes, those macro variables will resolve to the correct name of the data sets.&amp;nbsp; The only thing you would potentially change is the name of the variable that has the partition indicator, that I have as&amp;nbsp;&lt;STRONG&gt;&lt;SPAN&gt;TrainingOrValidation&lt;/SPAN&gt;&lt;/STRONG&gt;&amp;nbsp;and its values that I have as&amp;nbsp;&lt;SPAN&gt;'Training' and 'Validation'.&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;2. You do not need a Data Partition node after the SAS Code node, this is in place of the Data Partition node that you can then connect to whatever subsequent nodes.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Hope that helps!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 14:07:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537398#M7668</guid>
      <dc:creator>WendyCzika</dc:creator>
      <dc:date>2019-02-21T14:07:37Z</dc:date>
    </item>
    <item>
      <title>Re: Assign Observation to Train and Validation using the Values for a Particular Column</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537400#M7669</link>
      <description>&lt;P&gt;1. Thanks so much, I could see these names under 'Exported data'field in the properties tab. Also, As I see here below, I can also easily define all the code I wanted including for test/score dataset. Is my understanding right?.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SAS_Code_Exported data.PNG" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/27326i90DAB1460744809D/image-size/large?v=v2&amp;amp;px=999" role="button" title="SAS_Code_Exported data.PNG" alt="SAS_Code_Exported data.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. I understand. Thats a lot of help from you, Wendy.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind regards,&lt;/P&gt;&lt;P&gt;Mari&lt;/P&gt;</description>
      <pubDate>Thu, 21 Feb 2019 14:12:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Assign-Observation-to-Train-and-Validation-using-the-Values-for/m-p/537400#M7669</guid>
      <dc:creator>ggfggrr</dc:creator>
      <dc:date>2019-02-21T14:12:50Z</dc:date>
    </item>
  </channel>
</rss>

