<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Consolidation of categorical inputs in SAS Academy for Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Consolidation-of-categorical-inputs/m-p/645516#M723</link>
    <description>&lt;P&gt;Re: "&lt;FONT&gt;Applied Analytics Using SAS Enterprise Miner&lt;/FONT&gt;", "&lt;FONT&gt;Lesson 5: Regression Models Using SAS Enterprise Miner&lt;/FONT&gt;"&lt;/P&gt;&lt;P&gt;I&lt;FONT&gt;s there a rule of thumb in terms of how many levels should be considered too many? Would it be reasonable to say that if using a Regression node, it would make sense to have no input taking up more than 8-10 degrees of freedom?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT&gt;Or, for predictive modelling, is it ok to allow more levels for categorical variables?&lt;/FONT&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 06 May 2020 05:19:48 GMT</pubDate>
    <dc:creator>pvareschi</dc:creator>
    <dc:date>2020-05-06T05:19:48Z</dc:date>
    <item>
      <title>Consolidation of categorical inputs</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Consolidation-of-categorical-inputs/m-p/645516#M723</link>
      <description>&lt;P&gt;Re: "&lt;FONT&gt;Applied Analytics Using SAS Enterprise Miner&lt;/FONT&gt;", "&lt;FONT&gt;Lesson 5: Regression Models Using SAS Enterprise Miner&lt;/FONT&gt;"&lt;/P&gt;&lt;P&gt;I&lt;FONT&gt;s there a rule of thumb in terms of how many levels should be considered too many? Would it be reasonable to say that if using a Regression node, it would make sense to have no input taking up more than 8-10 degrees of freedom?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT&gt;Or, for predictive modelling, is it ok to allow more levels for categorical variables?&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 06 May 2020 05:19:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Consolidation-of-categorical-inputs/m-p/645516#M723</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-06T05:19:48Z</dc:date>
    </item>
    <item>
      <title>Re: Consolidation of categorical inputs</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Consolidation-of-categorical-inputs/m-p/645675#M729</link>
      <description>&lt;P&gt;I hate to provide rule of thump when optimal solutions for most options are data specific.&lt;/P&gt;
&lt;P&gt;However, for this case SAS Enterprise miner advanced metadata advisor is using 20 as the categorical levels threshold to reject the nominal variable. You could consider this default setting as the best rule of thump.&lt;/P&gt;</description>
      <pubDate>Wed, 06 May 2020 18:33:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Consolidation-of-categorical-inputs/m-p/645675#M729</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2020-05-06T18:33:20Z</dc:date>
    </item>
  </channel>
</rss>

