<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Decision Tree subtree properties selection in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/547480#M7757</link>
    <description>&lt;P&gt;I could understand in terms of Decision tree where we can select a sub tree.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In Gradient boosting as well I see subtree option:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As we have defined max depth(for example =2) already, In one iteration tree can maximum have 4 leaves(given max branch =2).&lt;/P&gt;&lt;P&gt;As the gradient boosting algorithm is a sequential...will it select the subtree before moving to the next iteration?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please provide explanation in terms of gradient boosting. Thanks in advance&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 31 Mar 2019 13:23:56 GMT</pubDate>
    <dc:creator>thia1169</dc:creator>
    <dc:date>2019-03-31T13:23:56Z</dc:date>
    <item>
      <title>Decision Tree subtree properties selection</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/480344#M7202</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would like to ask about the selection of subtree for Decision tree.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Subtree.PNG" style="width: 273px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/21921iF37614E8AD4BA93E/image-size/large?v=v2&amp;amp;px=999" role="button" title="Subtree.PNG" alt="Subtree.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What are the criterion to select in pair for Method option (Assessment, Largest and N) and Assessment measure option (Decision, Classifications, Average square error and Lift)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How about in the Iteration plot where there is also similar selection (Average square tree, Miclassification tree, Sum of square error, maximum absolute error, and subtree assessment plot) to decide the optimal number of leaves?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I find it quite confusing and wish someone can explain this to me. Thank you.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Potiu&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jul 2018 05:39:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/480344#M7202</guid>
      <dc:creator>potiu</dc:creator>
      <dc:date>2018-07-23T05:39:22Z</dc:date>
    </item>
    <item>
      <title>Re: Decision Tree subtree properties selection</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/480445#M7205</link>
      <description>&lt;P&gt;Here is information about those properties.&amp;nbsp; If you choose &lt;STRONG&gt;Assessment&lt;/STRONG&gt; for the subtree method,&amp;nbsp;then you should be able to see in the Iteration plot that the subtree selected has the best value for whatever assessment measure you chose, but you can view the other measures as well (in case you want to re-run using one of the other measures to get a different subtree).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;
&lt;DIV id="n1aqo9lc1nj47kn1l0j2opsoq9go" class="xis-item"&gt;
&lt;DIV id="n0jruiw0pkbekbn103zjues5kyog" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Method&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— specifies the method that you want to use to select a subtree from the fully grown tree for each possible number of leaves.&lt;/DIV&gt;
&lt;DIV id="p1nlu8t1mh1ogrn1595boesr7do4" class="xis-paraLeadIn"&gt;The following subtree methods are available:&lt;/DIV&gt;
&lt;DIV class="xis-listUnordered"&gt;
&lt;UL&gt;
&lt;LI&gt;
&lt;DIV id="n1wlu4r6ka4vjpn1xxga39dq47b0" class="xis-item"&gt;
&lt;DIV id="n157hwbslbdvc1n1k6bp5w3goojf" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Assessment&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;(default) — the smallest subtree with the best assessment value. The assessment value depends on the setting that you choose for the Assessment Measure property. Validation data set is used if available.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="n0gm9cm4b1vcjtn13iniqhtfjt5a" class="xis-item"&gt;
&lt;DIV id="n0gauqfes9stz1n1kfumqatspl8g" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Largest&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— the largest (full) tree is selected.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="p1up8477ad6c82n1ieku80upq8fa" class="xis-item"&gt;
&lt;DIV id="p1vznvsun0djrun13dd3i6xk84zs" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;N&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— the largest subtree with at most N leaves is selected. Use the Number of Leaves property to specify the value of N, the number of leaves.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="n0qr637qihl8nkn1grspx3zfv1y2" class="xis-item"&gt;
&lt;DIV id="n0ueo66f71mp2hn13ifvv8z8aj2p" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Number of Leaves&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— when the Method property of the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="xis-windowItem"&gt;Decision Tree&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;node is set to N, specifies the largest number of leaves that you want in a subtree of n leaves. Permitted values are integers greater than or equal to 1. The default value for the Number of Leaves property is 1.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="p1kczckq8h6shmn13cvr0f53a9f4" class="xis-item"&gt;
&lt;DIV id="p044qhurp47ur0n1qi9x128wquah" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Assessment Measure&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— specifies the method that you want to use to select the best tree, based on the validation data when the Method property is set to Assessment. If no validation data is available, training data is used.&lt;/DIV&gt;
&lt;DIV id="p1htamwrci52uan0z764qo0wbbyx" class="xis-paraLeadIn"&gt;The available assessment measurements are as follows:&lt;/DIV&gt;
&lt;DIV class="xis-listUnordered"&gt;
&lt;UL&gt;
&lt;LI&gt;
&lt;DIV id="n0rhld8nodniien1bwdh4vslclpu" class="xis-item"&gt;
&lt;DIV id="p0to0qjjuvq7a4n1w1bev9asgjox" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Decision&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;(default setting) — selects the tree that has the largest average profit and smallest average loss if a profit or loss matrix is defined. If no profit or loss matrix is defined, the value of the model assessment measure is reset in the training process, depending on the measurement level of the target. If the target is interval, the measure is set to Average Square Error. If the target is categorical, the measure is set to Misclassification.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="n046lrhrfyt8j8n1jalv52g220go" class="xis-item"&gt;
&lt;DIV id="n0w7kb1hb30miwn12ew66jvpewj0" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Average Square Error&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— selects the tree that has the smallest average square error.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="p01ren25mj1ziqn1jrq3bo9rc6mm" class="xis-item"&gt;
&lt;DIV id="n0h3cy129qsdx8n1m4h9u2d0axhd" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Misclassification&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— selects the tree that has the smallest misclassification rate.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV id="p17l96dogpq2rvn1ivvw5r0kgrue" class="xis-item"&gt;
&lt;DIV id="p0yovpxjr2k1yzn1krz2mgqavn56" class="xis-paraSimpleFirst"&gt;&lt;SPAN class="xis-windowItem"&gt;Lift&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;— evaluates the tree based on the prediction of the top n% of the ranked observations. Observations are ranked based on their posterior probabilities or predicted target values. For an interval target, it is the average predicted target value of the top n% observations. For a categorical target, it is the proportion of events in the top n% of the data. When you set the Measure property to Lift, you must use the Assessment Fraction property to specify the proportion for the top n% of cases.&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Mon, 23 Jul 2018 14:09:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/480445#M7205</guid>
      <dc:creator>WendyCzika</dc:creator>
      <dc:date>2018-07-23T14:09:16Z</dc:date>
    </item>
    <item>
      <title>Re: Decision Tree subtree properties selection</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/547480#M7757</link>
      <description>&lt;P&gt;I could understand in terms of Decision tree where we can select a sub tree.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In Gradient boosting as well I see subtree option:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As we have defined max depth(for example =2) already, In one iteration tree can maximum have 4 leaves(given max branch =2).&lt;/P&gt;&lt;P&gt;As the gradient boosting algorithm is a sequential...will it select the subtree before moving to the next iteration?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please provide explanation in terms of gradient boosting. Thanks in advance&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 31 Mar 2019 13:23:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Decision-Tree-subtree-properties-selection/m-p/547480#M7757</guid>
      <dc:creator>thia1169</dc:creator>
      <dc:date>2019-03-31T13:23:56Z</dc:date>
    </item>
  </channel>
</rss>

