<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: K-fold cross validation - is it applied at all? in SAS Studio</title>
    <link>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758608#M10103</link>
    <description>&lt;P&gt;Hello &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/386874"&gt;@yiyhio&lt;/a&gt; ,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is cross-validation for assessing / selecting the model(s), not for constructing the model(s).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is what the documentation says:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI class="xisDoc-item"&gt;&lt;SPAN class="xisDoc-windowItem"&gt;Validation method&lt;/SPAN&gt; — Specifies how to partition the data for assessing the models. Note that if your data is partitioned, then that partition is used and &lt;SPAN class="xisDoc-windowItem"&gt;Validation method&lt;/SPAN&gt;, &lt;SPAN class="xisDoc-windowItem"&gt;Validation data proportion&lt;/SPAN&gt;, and &lt;SPAN class="xisDoc-windowItem"&gt;Cross validation number of folds&lt;/SPAN&gt; are all ignored. Here are the possible values:
&lt;UL class="xisDoc-listUnordered"&gt;
&lt;LI class="xisDoc-item"&gt;&lt;SPAN class="xisDoc-windowItem"&gt;Partition&lt;/SPAN&gt; — Specifies using a single partition of a training set. With partition, you specify proportions to use for randomly assigning observations to each role.&lt;/LI&gt;
&lt;LI class="xisDoc-item"&gt;&lt;SPAN class="xisDoc-windowItem"&gt;K-fold cross validation&lt;/SPAN&gt; — Specifies using the &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt;-fold cross validation method. In &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt;-fold cross validation, each model evaluation requires &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt; training executions (on &lt;EM class="xisDoc-definedTerm"&gt;k-1&lt;/EM&gt; data folds) and &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt; scoring executions (on one holdout fold). This increases the evaluation time by approximately a factor of &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt;.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="xisDoc-paragraph"&gt;For small to medium data tables, cross validation provides, on average, a better representation of error across the whole data table. &lt;SPAN class="xisDoc-windowItem"&gt;Partition&lt;/SPAN&gt; is the default value.&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt;To use k-fold cross-validation for constructing the model, see here:&lt;/P&gt;
&lt;P&gt;Cross Validation of a Forest Model&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_example01.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_example01.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;The above example uses the crossValidateML action (in PROC CAS).&lt;BR /&gt;The crossValidateML Action doc:&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_details02.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_details02.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Kind regards,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 31 Jul 2021 19:51:21 GMT</pubDate>
    <dc:creator>sbxkoenk</dc:creator>
    <dc:date>2021-07-31T19:51:21Z</dc:date>
    <item>
      <title>K-fold cross validation - is it applied at all?</title>
      <link>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758343#M10089</link>
      <description>&lt;DIV&gt;&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have trained a model in model studio using a gradient boosting method with a 5-fold cross validation. However, on the results page, on the table that gives a summary of the amount of data used for training/validation, I cannot see the connection between this and my chosen 5-fold.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here, I chose validation method&lt;/P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="yiyhio_0-1627630591435.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/62122iFDB399ADBC0FE9B0/image-size/medium?v=v2&amp;amp;px=400" role="button" title="yiyhio_0-1627630591435.png" alt="yiyhio_0-1627630591435.png" /&gt;&lt;/span&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And when I look at the results table for the same node, it says that it is divided into approximately 60% and 30% for the training and validation set.&lt;/P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="yiyhio_1-1627630604589.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/62123iD2F72891E153FF9C/image-size/medium?v=v2&amp;amp;px=400" role="button" title="yiyhio_1-1627630604589.png" alt="yiyhio_1-1627630604589.png" /&gt;&lt;/span&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was wondering what this means? Does the 5-fold cross validation not apply for some reason, or does this mean something else?&lt;/P&gt;&lt;P&gt;Thank you in advance!&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 30 Jul 2021 07:36:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758343#M10089</guid>
      <dc:creator>yiyhio</dc:creator>
      <dc:date>2021-07-30T07:36:53Z</dc:date>
    </item>
    <item>
      <title>Re: K-fold cross validation - is it applied at all?</title>
      <link>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758608#M10103</link>
      <description>&lt;P&gt;Hello &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/386874"&gt;@yiyhio&lt;/a&gt; ,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is cross-validation for assessing / selecting the model(s), not for constructing the model(s).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is what the documentation says:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI class="xisDoc-item"&gt;&lt;SPAN class="xisDoc-windowItem"&gt;Validation method&lt;/SPAN&gt; — Specifies how to partition the data for assessing the models. Note that if your data is partitioned, then that partition is used and &lt;SPAN class="xisDoc-windowItem"&gt;Validation method&lt;/SPAN&gt;, &lt;SPAN class="xisDoc-windowItem"&gt;Validation data proportion&lt;/SPAN&gt;, and &lt;SPAN class="xisDoc-windowItem"&gt;Cross validation number of folds&lt;/SPAN&gt; are all ignored. Here are the possible values:
&lt;UL class="xisDoc-listUnordered"&gt;
&lt;LI class="xisDoc-item"&gt;&lt;SPAN class="xisDoc-windowItem"&gt;Partition&lt;/SPAN&gt; — Specifies using a single partition of a training set. With partition, you specify proportions to use for randomly assigning observations to each role.&lt;/LI&gt;
&lt;LI class="xisDoc-item"&gt;&lt;SPAN class="xisDoc-windowItem"&gt;K-fold cross validation&lt;/SPAN&gt; — Specifies using the &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt;-fold cross validation method. In &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt;-fold cross validation, each model evaluation requires &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt; training executions (on &lt;EM class="xisDoc-definedTerm"&gt;k-1&lt;/EM&gt; data folds) and &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt; scoring executions (on one holdout fold). This increases the evaluation time by approximately a factor of &lt;EM class="xisDoc-definedTerm"&gt;k&lt;/EM&gt;.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="xisDoc-paragraph"&gt;For small to medium data tables, cross validation provides, on average, a better representation of error across the whole data table. &lt;SPAN class="xisDoc-windowItem"&gt;Partition&lt;/SPAN&gt; is the default value.&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt;To use k-fold cross-validation for constructing the model, see here:&lt;/P&gt;
&lt;P&gt;Cross Validation of a Forest Model&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_example01.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_example01.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;The above example uses the crossValidateML action (in PROC CAS).&lt;BR /&gt;The crossValidateML Action doc:&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_details02.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/casactml/casactml_mltools_details02.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Kind regards,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 31 Jul 2021 19:51:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758608#M10103</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-07-31T19:51:21Z</dc:date>
    </item>
    <item>
      <title>Re: K-fold cross validation - is it applied at all?</title>
      <link>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758609#M10104</link>
      <description>&lt;P&gt;See also my previous response!!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For questions like this, it's better to post in the board :&lt;/P&gt;
&lt;P&gt;Analytics &amp;gt; SAS Data Mining and Machine Learning.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;More (many more!) of the people in your target audience will read your question (topic).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Kind regards,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Sat, 31 Jul 2021 19:56:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Studio/K-fold-cross-validation-is-it-applied-at-all/m-p/758609#M10104</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-07-31T19:56:39Z</dc:date>
    </item>
  </channel>
</rss>

