BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pvareschi
Quartz | Level 8

Re: Predictive Modeling Using Logistic Regression

Just to confirm my understanding of the following statement at the bottom of page 3.47 of the course text: "Very liberal univariate screening might be helpful when the number of clusters created in PROC VARCLUS is still relatively large".

Does "liberal univariate screening" mean that it is better to err on the side of allowing more inputs through the screening and then rely on regression selection techniques to find the best predictors?

1 ACCEPTED SOLUTION

Accepted Solutions
sasmlp
SAS Employee

If you plan to use the best subset selection method in PROC LOGISTIC, you need to get the number of predictor variables down to around 50 or else the CPU time will be fairly excessive. Therefore, if the number of clusters obtained by PROC VARCLUS is much greater than 50, then further screening methods such as the Spearman and Hoeffding correlation statistics could be used to further reduce the number of predictor variables available to PROC LOGISTIC. Very liberal univariate screening simply means reducing the number of variables down to a reasonable number. 

View solution in original post

1 REPLY 1
sasmlp
SAS Employee

If you plan to use the best subset selection method in PROC LOGISTIC, you need to get the number of predictor variables down to around 50 or else the CPU time will be fairly excessive. Therefore, if the number of clusters obtained by PROC VARCLUS is much greater than 50, then further screening methods such as the Spearman and Hoeffding correlation statistics could be used to further reduce the number of predictor variables available to PROC LOGISTIC. Very liberal univariate screening simply means reducing the number of variables down to a reasonable number. 

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 962 views
  • 0 likes
  • 2 in conversation