<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc in SAS/STAT to make the dataset partition (machine learning) in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551145#M27485</link>
    <description>&lt;P&gt;Yes, the article &lt;A href="https://blogs.sas.com/content/iml/2019/01/21/training-validation-test-data-sas.html" target="_self"&gt;"Create training, validation, and test data sets in SAS",&amp;nbsp;&lt;/A&gt;describes how to partition data by using the DATA step or by using PROC SURVEYSELECT. The article describes the differences between the two approaches so that you can decide which one is more appropriate for your needs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The article assumes three data sets, but you can modify the code to remove "test" data set.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After you decide which method you want to use, you can define a macro wrapper (such as %PARTITION) and reuse the code multiple times.&lt;/P&gt;</description>
    <pubDate>Mon, 15 Apr 2019 17:22:49 GMT</pubDate>
    <dc:creator>Rick_SAS</dc:creator>
    <dc:date>2019-04-15T17:22:49Z</dc:date>
    <item>
      <title>Proc in SAS/STAT to make the dataset partition (machine learning)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551111#M27484</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would like ti know ig there is any stat proc in SAS/STAT to make the dataset partition in a Machine Learning process (train and validation dataset). Something similar to the 'partition' task in SAS/Enterprise Miner&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
      <pubDate>Mon, 15 Apr 2019 16:00:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551111#M27484</guid>
      <dc:creator>juanvg1972</dc:creator>
      <dc:date>2019-04-15T16:00:07Z</dc:date>
    </item>
    <item>
      <title>Re: Proc in SAS/STAT to make the dataset partition (machine learning)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551145#M27485</link>
      <description>&lt;P&gt;Yes, the article &lt;A href="https://blogs.sas.com/content/iml/2019/01/21/training-validation-test-data-sas.html" target="_self"&gt;"Create training, validation, and test data sets in SAS",&amp;nbsp;&lt;/A&gt;describes how to partition data by using the DATA step or by using PROC SURVEYSELECT. The article describes the differences between the two approaches so that you can decide which one is more appropriate for your needs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The article assumes three data sets, but you can modify the code to remove "test" data set.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After you decide which method you want to use, you can define a macro wrapper (such as %PARTITION) and reuse the code multiple times.&lt;/P&gt;</description>
      <pubDate>Mon, 15 Apr 2019 17:22:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551145#M27485</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-04-15T17:22:49Z</dc:date>
    </item>
    <item>
      <title>Re: Proc in SAS/STAT to make the dataset partition (machine learning)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551199#M27486</link>
      <description>&lt;P&gt;Thank you very much Rick, is really usefull&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Only one question: ¿is there any way to have balanced datatset?, For example the % of cases of one variable?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks again&lt;/P&gt;</description>
      <pubDate>Mon, 15 Apr 2019 18:58:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551199#M27486</guid>
      <dc:creator>juanvg1972</dc:creator>
      <dc:date>2019-04-15T18:58:40Z</dc:date>
    </item>
    <item>
      <title>Re: Proc in SAS/STAT to make the dataset partition (machine learning)</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551238#M27489</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/52605"&gt;@juanvg1972&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Thank you very much Rick, is really usefull&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Only one question: ¿is there any way to have balanced datatset?, For example the % of cases of one variable?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks again&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;You are likely getting to the point where you need to provide a more concrete example of all of the rules you might be attempting to enforce.&lt;/P&gt;
&lt;P&gt;Surveyselect with STRATA might do what you are thinking but there are several ways to interpret your question. Actual data values might help but the GROUP option wants a number of records not a percent. If you only have two groups then SAMPRATE might be what you want as you would get selected/non-selected with the correct syntax.&lt;/P&gt;</description>
      <pubDate>Mon, 15 Apr 2019 20:12:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-in-SAS-STAT-to-make-the-dataset-partition-machine-learning/m-p/551238#M27489</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2019-04-15T20:12:11Z</dc:date>
    </item>
  </channel>
</rss>

