BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pvareschi
Quartz | Level 8

Re: Predictive Modeling Using Logistic Regression

Thresholding for collapsing levels (p.3-17)
When applying thresholding (page 3.17 of course text), instead of grouping all small levels into a single "OTHER", as an alternative approach, would it not make sense to try to aggregate them with the other existing levels, either based on domain knowledge/similarity in meaning (e.g. for residential status, all levels related to "renting" could be grouped together) and/or proportion of response?

1 ACCEPTED SOLUTION

Accepted Solutions
gcjfernandez
SAS Employee

Re: Predictive Modeling Using Logistic Regression

Thresholding for collapsing levels (p.3-17)
When applying thresholding (page 3.17 of course text), instead of grouping all small levels into a single "OTHER", as an alternative approach, would it not make sense to try to aggregate them with the other existing levels, either based on domain knowledge/similarity in meaning (e.g. for residential status, all levels related to "renting" could be grouped together) and/or proportion of response?

My response:

I agree with your comments that rather than dumping rare levels into other group we could use  your business knowledge or tools available in SAS EM (Decision tree node, Variable selection mode) and assign rare levels to other correlated levels.

View solution in original post

1 REPLY 1
gcjfernandez
SAS Employee

Re: Predictive Modeling Using Logistic Regression

Thresholding for collapsing levels (p.3-17)
When applying thresholding (page 3.17 of course text), instead of grouping all small levels into a single "OTHER", as an alternative approach, would it not make sense to try to aggregate them with the other existing levels, either based on domain knowledge/similarity in meaning (e.g. for residential status, all levels related to "renting" could be grouped together) and/or proportion of response?

My response:

I agree with your comments that rather than dumping rare levels into other group we could use  your business knowledge or tools available in SAS EM (Decision tree node, Variable selection mode) and assign rare levels to other correlated levels.

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 375 views
  • 0 likes
  • 2 in conversation