BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pvareschi
Quartz | Level 8

Re: "Applied Analytics Using SAS Enterprise Miner", "Lesson 5: Regression Models Using SAS Enterprise Miner"

Is there a rule of thumb in terms of how many levels should be considered too many? Would it be reasonable to say that if using a Regression node, it would make sense to have no input taking up more than 8-10 degrees of freedom?

Or, for predictive modelling, is it ok to allow more levels for categorical variables?

1 ACCEPTED SOLUTION

Accepted Solutions
gcjfernandez
SAS Employee

I hate to provide rule of thump when optimal solutions for most options are data specific.

However, for this case SAS Enterprise miner advanced metadata advisor is using 20 as the categorical levels threshold to reject the nominal variable. You could consider this default setting as the best rule of thump.

View solution in original post

1 REPLY 1
gcjfernandez
SAS Employee

I hate to provide rule of thump when optimal solutions for most options are data specific.

However, for this case SAS Enterprise miner advanced metadata advisor is using 20 as the categorical levels threshold to reject the nominal variable. You could consider this default setting as the best rule of thump.

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 480 views
  • 0 likes
  • 2 in conversation