<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Specifying splits in EM 7.1 Decision Trees in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Specifying-splits-in-EM-7-1-Decision-Trees/m-p/102864#M860</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there a way to specify the number of bins that EM uses when calculating the optimal splitting value for an interval level input?&amp;nbsp; I am building a decision tree model, and would like to specify the number of divisions/bins/comparison points that EM uses to calculate the optimal value for splitting.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This may be best explained with an example: Say var1 ranges from 1 to 100 and I want to determine the optimal binary split for this variable.&amp;nbsp; My understanding is that EM would bin var1 into a number of buckets and then check the point between each bin as a potential split point.&amp;nbsp; So if there were 100 bins, EM would check bucket1 vs buckets2-100, then buckets1-2 vs 3-100, etc... and select the split with the best logworth.&amp;nbsp; My question is: how can I specify the number of bins that are used in this procedure?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your time.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Chad Atkinson&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 09 Jul 2012 17:36:21 GMT</pubDate>
    <dc:creator>ChadAtkinson</dc:creator>
    <dc:date>2012-07-09T17:36:21Z</dc:date>
    <item>
      <title>Specifying splits in EM 7.1 Decision Trees</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Specifying-splits-in-EM-7-1-Decision-Trees/m-p/102864#M860</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there a way to specify the number of bins that EM uses when calculating the optimal splitting value for an interval level input?&amp;nbsp; I am building a decision tree model, and would like to specify the number of divisions/bins/comparison points that EM uses to calculate the optimal value for splitting.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This may be best explained with an example: Say var1 ranges from 1 to 100 and I want to determine the optimal binary split for this variable.&amp;nbsp; My understanding is that EM would bin var1 into a number of buckets and then check the point between each bin as a potential split point.&amp;nbsp; So if there were 100 bins, EM would check bucket1 vs buckets2-100, then buckets1-2 vs 3-100, etc... and select the split with the best logworth.&amp;nbsp; My question is: how can I specify the number of bins that are used in this procedure?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your time.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Chad Atkinson&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Jul 2012 17:36:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Specifying-splits-in-EM-7-1-Decision-Trees/m-p/102864#M860</guid>
      <dc:creator>ChadAtkinson</dc:creator>
      <dc:date>2012-07-09T17:36:21Z</dc:date>
    </item>
    <item>
      <title>Re: Specifying splits in EM 7.1 Decision Trees</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Specifying-splits-in-EM-7-1-Decision-Trees/m-p/102865#M861</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It may be poor form to answer my own post, but perhaps it will assist someone else.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you use the proc arbor statement, there is an option (exhaustive=) that controls the number of bins that are used when determining the optimal split point in decision tree construction.&amp;nbsp; The default is 5000.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 20 Jul 2012 13:13:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Specifying-splits-in-EM-7-1-Decision-Trees/m-p/102865#M861</guid>
      <dc:creator>ChadAtkinson</dc:creator>
      <dc:date>2012-07-20T13:13:42Z</dc:date>
    </item>
  </channel>
</rss>

