BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
UcheOkoro
Lapis Lazuli | Level 10

Hello,

 

Please, I need help figuring out why there is a great disparity in the binomial regression odds ratio between proc genmod and proc logistic. I would also like to know the more reliable result.

The data was obtained after matching. The following are my codes and results.

 

Thank you!

proc genmod data=COVID.COVID_TRAUMA_MATCHED2 descending;
class  subclass COVID_19_Positivity(DESC) ; 
model  In_hosp_mortality(event='2')= COVID_19_Positivity
/ dist=bin ;
 repeated sub=subclass /type=cs corrw covb printmle;
 lsmeans COVID_19_Positivity/ diff exp cl or;
run;


proc logistic data=COVID.COVID_TRAUMA_MATCHED2; 
CLASS COVID_19_Positivity(DESC); 
strata  subclass;
   model In_hosp_mortality(event='2')= COVID_19_Positivity;
  run;

The result from proc genmod

UcheOkoro_0-1652197634677.png

The results from proc logistic

UcheOkoro_2-1652197698509.png

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
StatDave
SAS Super FREQ

It is because those are entirely different models - one a Generalized Estimating Equations logistic model using GEE (not maximum likelihood) estimation, and the other is a conditional logistic model that maximizes a conditional likelihood. Differences should be expected when fitting such different models using differing estimation algorithms. Importantly, the GEE model is a population-averaged model while the conditional logistic model is a subject-specific model. Yet another possible approach is the random-effects model as can be fit in PROC GLIMMIX which is also a subject-specific model. If you want to make inferences about population differences, then the population-averaged GEE model is more appropriate. If you want to make inferences about, say, change over time (if the repeated measures are over time), then a subject-specific approach can be used. This distinction in model types and estimation methods is discussed in many texts and papers. One paper available online is Hu et. al. (1998), "Comparison of Population-Averaged and Subject-Specific Approaches for Analyzing Repeated Binary Outcomes."

 

For these and other kinds of logistic models, see this note.

 

View solution in original post

2 REPLIES 2
StatDave
SAS Super FREQ

It is because those are entirely different models - one a Generalized Estimating Equations logistic model using GEE (not maximum likelihood) estimation, and the other is a conditional logistic model that maximizes a conditional likelihood. Differences should be expected when fitting such different models using differing estimation algorithms. Importantly, the GEE model is a population-averaged model while the conditional logistic model is a subject-specific model. Yet another possible approach is the random-effects model as can be fit in PROC GLIMMIX which is also a subject-specific model. If you want to make inferences about population differences, then the population-averaged GEE model is more appropriate. If you want to make inferences about, say, change over time (if the repeated measures are over time), then a subject-specific approach can be used. This distinction in model types and estimation methods is discussed in many texts and papers. One paper available online is Hu et. al. (1998), "Comparison of Population-Averaged and Subject-Specific Approaches for Analyzing Repeated Binary Outcomes."

 

For these and other kinds of logistic models, see this note.

 

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 366 views
  • 4 likes
  • 2 in conversation