<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Enterprise Miner - Decision Tree - Cross Validation in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-Decision-Tree-Cross-Validation/m-p/629078#M8168</link>
    <description>&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Dear community,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;I need to better understand what the property „Perform Cross Validation“ in the section „Cross Validation“ for a decision tree does in general.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;For me the purpose of cross validation (CV) is not to help select a particular tree (as the final model) but rather to qualify a model (which is created by 100% of the training sample before the CV), i.e. to provide metrics such as the average MSE (average of all “sub-trees” generated by the CV) which can be useful in asserting the level of precision one can expect from the application.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Now I have run two trees separately, one with “Perform Cross Validation”=yes and one without. The trees are different, i.e. the tree with CV=yes has less leaves. According to this outcome I assume that the enterprise miner uses a specific tree created by the CV as the final model (probably the one with the smallest MSE). I.e. a tree which is trained by 100-X% instead of 100% of the initial training sample.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Or does the results of the cross validation (average MSE) are used for pruning the original tree? However in this case pruning would be executed after CV…In my case I have selected the pruning property method “assessment” in section subtree.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;I already thank you for your precious assistance! As it is a general question I hope this can be answered without data, codes.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Best regards&lt;/FONT&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 03 Mar 2020 10:46:06 GMT</pubDate>
    <dc:creator>JKarp_11</dc:creator>
    <dc:date>2020-03-03T10:46:06Z</dc:date>
    <item>
      <title>Enterprise Miner - Decision Tree - Cross Validation</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-Decision-Tree-Cross-Validation/m-p/629078#M8168</link>
      <description>&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Dear community,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;I need to better understand what the property „Perform Cross Validation“ in the section „Cross Validation“ for a decision tree does in general.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;For me the purpose of cross validation (CV) is not to help select a particular tree (as the final model) but rather to qualify a model (which is created by 100% of the training sample before the CV), i.e. to provide metrics such as the average MSE (average of all “sub-trees” generated by the CV) which can be useful in asserting the level of precision one can expect from the application.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Now I have run two trees separately, one with “Perform Cross Validation”=yes and one without. The trees are different, i.e. the tree with CV=yes has less leaves. According to this outcome I assume that the enterprise miner uses a specific tree created by the CV as the final model (probably the one with the smallest MSE). I.e. a tree which is trained by 100-X% instead of 100% of the initial training sample.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Or does the results of the cross validation (average MSE) are used for pruning the original tree? However in this case pruning would be executed after CV…In my case I have selected the pruning property method “assessment” in section subtree.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;I already thank you for your precious assistance! As it is a general question I hope this can be answered without data, codes.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;Best regards&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 03 Mar 2020 10:46:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Enterprise-Miner-Decision-Tree-Cross-Validation/m-p/629078#M8168</guid>
      <dc:creator>JKarp_11</dc:creator>
      <dc:date>2020-03-03T10:46:06Z</dc:date>
    </item>
  </channel>
</rss>

