BookmarkSubscribeRSS Feed
Asquared
Calcite | Level 5

I have an ordinal independent variable and ordinal response variable. I used PRoc logistic and checked score test for proportional odds. It did not hold true. Therefore I resorted to Generalized logit. But it gives me the following warning:

The validity of the model fit is questionable.

 

And the log says the following:

 

There is possibly a quasi-complete separation of data points. The maximum likelihood
estimate may not exist.
WARNING: The LOGISTIC procedure continues in spite of the above warning. Results shown are based
on the last maximum likelihood iteration. The validity of the model fit is questionable.

 

 

 

What would be an appropriate way to get an outcome? Or is there any way by which i can eliminate the above errors?

1 REPLY 1
sbxkoenk
SAS Super FREQ

For background info on (quasi-)complete separation, see:

Usage Note 22599: Understanding and correcting complete or quasi-complete separation problems

http://support.sas.com/kb/22/599.html

 

But even when you have a separation condition, the resulting model can be quite good at classifying observations. Check this on a holdout dataset! Holdout dataset = independent observations with known outcome but never seen by the model while training it.

However when you have a separation condition, the resulting model cannot be interpreted. Inference about regression coefficients and odds ratios should be avoided, because maximum likelihood estimates for the model parameters do not exist. You simply treat the model as if it is produced by an uninterpretable machine learning algorithm (like neural nets).

 

What can you do to avoid the separation condition?

Collapsing levels of categorical variables and binning interval variables are commonly used techniques to deal with separation condition.

 

Good luck,

Koen

Brussels

 

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1732 views
  • 0 likes
  • 2 in conversation