BookmarkSubscribeRSS Feed
wcw2
Obsidian | Level 7

I'm running a model in Proc Logistic, modeling the probability of a negative culture (Y/N) with the dichotomous predictors drug (Y/N) and disease severity (Y/N). I also need to include study site (34 of these and many are sparsely populated) as it's a confounder. However, when I do, the model falls apart ("Quasi-complete separation of data points detected...WARNING: The maximum likelihood estimate may not exist....WARNING: The validity of the model fit is questionable."), I guess because there are so many sites. How do I approach this problem? Should I group the sites into several chunks? I don't often run multivariate models. Thank you.

3 REPLIES 3
PaigeMiller
Diamond | Level 26

Generally very sparse predictor variables are indeed a problem. You could group the sites, if there is a meaningful way to do such a grouping. Or you could try to find some continuous variable that might represent the sites. 

--
Paige Miller
wcw2
Obsidian | Level 7

OK, thanks. Yes, my plan is to just group them. Most of the population is African sites, so will try Africa/non-Africa groups.

StatDave
SAS Super FREQ

You could fit a conditional logistic model by stratifying on the sites by using the STRATA statement. Doing this will remove the need to estimate the separate parameters for the sites. See the conditional logistic example in the PROC LOGISTIC documentation. If you need to estimate the site parameters, you could try using the penalized likelihood method by adding the FIRTH option. Another possibility is exact estimation, but this is very resource intensive and might not be feasible.

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 379 views
  • 0 likes
  • 3 in conversation