BookmarkSubscribeRSS Feed
Asquared
Calcite | Level 5

I have an ordinal independent variable and ordinal response variable. I used PRoc logistic and checked score test for proportional odds. It did not hold true. Therefore I resorted to Generalized logit. But it gives me the following warning:

The validity of the model fit is questionable.

 

And the log says the following:

 

There is possibly a quasi-complete separation of data points. The maximum likelihood
estimate may not exist.
WARNING: The LOGISTIC procedure continues in spite of the above warning. Results shown are based
on the last maximum likelihood iteration. The validity of the model fit is questionable.

 

 

 

What would be an appropriate way to get an outcome? Or is there any way by which i can eliminate the above errors?

1 REPLY 1
sbxkoenk
SAS Super FREQ

For background info on (quasi-)complete separation, see:

Usage Note 22599: Understanding and correcting complete or quasi-complete separation problems

http://support.sas.com/kb/22/599.html

 

But even when you have a separation condition, the resulting model can be quite good at classifying observations. Check this on a holdout dataset! Holdout dataset = independent observations with known outcome but never seen by the model while training it.

However when you have a separation condition, the resulting model cannot be interpreted. Inference about regression coefficients and odds ratios should be avoided, because maximum likelihood estimates for the model parameters do not exist. You simply treat the model as if it is produced by an uninterpretable machine learning algorithm (like neural nets).

 

What can you do to avoid the separation condition?

Collapsing levels of categorical variables and binning interval variables are commonly used techniques to deal with separation condition.

 

Good luck,

Koen

Brussels

 

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1750 views
  • 0 likes
  • 2 in conversation