<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Model studio - Use a variable to perform partitions in the train and test sets in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Model-studio-Use-a-variable-to-perform-partitions-in-the-train/m-p/837672#M10330</link>
    <description>&lt;P&gt;Hi all&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've imported a dataset in model studio to build a pipeline on that. Within the dataset, I have a nominal variable which I want use to stratify both the train and test set with specific proportions (example: 30% class 1, 50% class2, 20% class3).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now, I'm not able to find documentation explaining how to perform that stratification nor in the Data tab nor in the pipeline.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There is the chance to define the variable with role as "Partition": but then it only allow to assign one class of the variable to one specific set only (ex. class1 to its totality to the training set..)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does anyone know if there is a way to setting a specific stratification in both the sets using a variable from the dataset?&lt;/P&gt;</description>
    <pubDate>Mon, 10 Oct 2022 14:52:19 GMT</pubDate>
    <dc:creator>dcortell</dc:creator>
    <dc:date>2022-10-10T14:52:19Z</dc:date>
    <item>
      <title>Model studio - Use a variable to perform partitions in the train and test sets</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Model-studio-Use-a-variable-to-perform-partitions-in-the-train/m-p/837672#M10330</link>
      <description>&lt;P&gt;Hi all&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've imported a dataset in model studio to build a pipeline on that. Within the dataset, I have a nominal variable which I want use to stratify both the train and test set with specific proportions (example: 30% class 1, 50% class2, 20% class3).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now, I'm not able to find documentation explaining how to perform that stratification nor in the Data tab nor in the pipeline.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There is the chance to define the variable with role as "Partition": but then it only allow to assign one class of the variable to one specific set only (ex. class1 to its totality to the training set..)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does anyone know if there is a way to setting a specific stratification in both the sets using a variable from the dataset?&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 14:52:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Model-studio-Use-a-variable-to-perform-partitions-in-the-train/m-p/837672#M10330</guid>
      <dc:creator>dcortell</dc:creator>
      <dc:date>2022-10-10T14:52:19Z</dc:date>
    </item>
    <item>
      <title>Re: Model studio - Use a variable to perform partitions in the train and test sets</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Model-studio-Use-a-variable-to-perform-partitions-in-the-train/m-p/837683#M10331</link>
      <description>&lt;P&gt;Checked, but it seems with current edition of Model studio I can't choose multiple class variable levels to map to a single partition level (training/validation/test) from with-in Model Studio (VML), so I will have do this before the dataset import.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 15:29:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Model-studio-Use-a-variable-to-perform-partitions-in-the-train/m-p/837683#M10331</guid>
      <dc:creator>dcortell</dc:creator>
      <dc:date>2022-10-10T15:29:52Z</dc:date>
    </item>
  </channel>
</rss>

