<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: SAS EM Node Rules code is omitting data records in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165525#M1833</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;OK, I finally figured that out. In addition to all of the above settings, I put the Minimum Categorical Size =&lt;SPAN style="color: red;"&gt;1 and now I have all values. &lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 02 Apr 2014 23:15:55 GMT</pubDate>
    <dc:creator>MarianaL</dc:creator>
    <dc:date>2014-04-02T23:15:55Z</dc:date>
    <item>
      <title>SAS EM Node Rules code is omitting data records</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165522#M1830</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;I am using interactive decision trees (due to business requirements). I noticed that the SAS code for Node Rules in omitting some of my data records in the if-then clauses. What is the reason and how should I assign/classify the omitted records? Thank you.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 02 Apr 2014 19:50:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165522#M1830</guid>
      <dc:creator>MarianaL</dc:creator>
      <dc:date>2014-04-02T19:50:04Z</dc:date>
    </item>
    <item>
      <title>Re: SAS EM Node Rules code is omitting data records</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165523#M1831</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hi Mariana,&lt;/P&gt;&lt;P&gt;It seems to me that you have a very recent version of Enterprise Miner, as we used to call these rules the "English rules". Now we use the more accurate term "Node rules".&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you open the results and you go to View-&amp;gt;Model-&amp;gt;Node Rules, you will only see the rules for the terminal leaves of this tree, this means, the nodes that have no further splits.&lt;BR /&gt;In the example below, the nodes 4, 5, and 6 are the terminal leaves of this tree. For this example, the node rules file only has the rules for these three nodes. Even if this file does not have the rules for node 3, you are not omitting any record, as the sum of the counts for the leaves is the total observations. In this example 547+1155+4258 are the total 5960 observations from the root node.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;IMG alt="forMariana.jpg" class="jive-image" src="https://communities.sas.com/legacyfs/online/5891_forMariana.jpg" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You can still see the node rules for any node (including those that are not a terminal leaf). On interactive mode, or from the tree plot on the results, right click on Tools-&amp;gt;Display node rule (or English rule).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Finally, notice that Node Rules is a pseudo-code, hence the old name "English" rules. They help you understand the rules for the nodes of the tree, but Node Rules file is not a piece of SAS code you can run... but close!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I hope it helps,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-Miguel&lt;/P&gt;&lt;P&gt;&lt;A _jive_internal="true" href="https://communities.sas.com/people/MiguelMaldonado?view=profile" style="color: #6611cc; text-indent: 0px; background-color: #ffffff;" target="_blank"&gt;SAS Profile&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 02 Apr 2014 21:14:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165523#M1831</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2014-04-02T21:14:33Z</dc:date>
    </item>
    <item>
      <title>Re: SAS EM Node Rules code is omitting data records</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165524#M1832</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Migel,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for your answer. In my case, in order to understand what is happening, I am using only one level of depth, and I need to split around 10000 records into 10 groups by only one categorical (nominal) variable. The frequencies add up but not all of the categorical levels are included in the if-then clauses in the SAS Node Rules. I have been trying this for several different categorical variables, and I always have few missing levels. Since they are not interval variables, nor ordinal, they have to be grouped by target predicting probabilities. I am using Sample=None option and I tried with and without partitioning, and I always have missing levels. I think I may have to set some number of levels variable to use all of the levels somewhere. I have only 70 levels for this example. When creating the data, I did the customiztion and set &lt;STRONG style="font-size: 11pt; font-family: 'Calibri','sans-serif';"&gt;Class Levels Count Threshold&lt;/STRONG&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; font-size: 11pt;"&gt; to 100, but it did not help. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 02 Apr 2014 22:06:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165524#M1832</guid>
      <dc:creator>MarianaL</dc:creator>
      <dc:date>2014-04-02T22:06:53Z</dc:date>
    </item>
    <item>
      <title>Re: SAS EM Node Rules code is omitting data records</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165525#M1833</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;OK, I finally figured that out. In addition to all of the above settings, I put the Minimum Categorical Size =&lt;SPAN style="color: red;"&gt;1 and now I have all values. &lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 02 Apr 2014 23:15:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-Node-Rules-code-is-omitting-data-records/m-p/165525#M1833</guid>
      <dc:creator>MarianaL</dc:creator>
      <dc:date>2014-04-02T23:15:55Z</dc:date>
    </item>
  </channel>
</rss>

