PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Lauren_Hanna · Posted 02-24-2023 11:39 AM

Hi - I have a dataset on cattle where we captured the amount of time each animal ate food (in minutes). We have several class effects to include (breed, size, year of study, and day relative to estrus). Running this model in PROC MIXED "as is" suggests the residuals are not normally distributed. Comparing square root and log transformations in PROC MIXED suggest that square root transformation is a better option. When I run the exact same model in PROC GLIMMIX so I can get inverse link means and standard errors, the outcomes are not the same (with or without nloptions in code provided). PROC GLIMMIX almost looks like PROC MIXED when the variable was not transformed. I'm not sure why and I am hoping experts here can help explain. I've attached SAS output of these scenarios to demonstrate. Code used includes:

*TIME, Min - no transformation;
proc mixed data= nobullestrusC plots = all;
class HeiferID Breed FSGrp Year DRE;
model TimeMin = Year Date Breed|DRE FSGrp|DRE / ddfm=kr;
repeated DRE / subject=HeiferID type=csh;
run;

*TIME, Min - log function transformed;
proc mixed data= nobullestrusC plots = all;
class HeiferID Breed FSGrp Year DRE;
model logTmin = Year Date Breed|DRE FSGrp|DRE / ddfm=kr;
repeated DRE / subject=HeiferID type=csh;
run;

*TIME, Min - square root function transformed;
proc mixed data= nobullestrusC plots = all;
class HeiferID Breed FSGrp Year DRE;
model sqrtTmin = Year Date Breed|DRE FSGrp|DRE / ddfm=kr;
repeated DRE / subject=HeiferID type=csh;
run;

*Time, Min - square root transformation in glimmix;
proc glimmix data = nobullestrusC plots=all;
nloptions technique = NRRIDG;
class HeiferID Breed FSGrp Year DRE;
model TimeMin = Year Date Breed|DRE FSGrp|DRE / ddfm=kr link = power(0.5);
random DRE / subject = HeiferID type = CSH residual; 
run;

I'll also note that the covariance structure type was limited going across procedures, but CSH was the best fit of ones that worked across both. Thank you for help here!

jiltao · Posted 02-24-2023 07:14 PM

The MIXED approach is using the square root of y as the response variable; the PROC GLIMMIX approach is modeling the square root of mean (mu), not y. The two models are different.

Thanks,

Jill

Lauren_Hanna · Posted 02-27-2023 10:44 AM

Thank you Jill. How do I need to change the PROC GLIMMIX code so that it mirrors the PROC MIXED version? Is that possible? I thought it was from some other SAS Community posts I read, but perhaps I am mistaken.

jiltao · Posted 02-27-2023 05:22 PM

You would need to use the same transformed dependent variable in PROC GLIMMIX as you did in PROC MIXED in order to get the same result in PROC GLIMMIX. For example,

proc glimmix data= nobullestrusC plots = all;
class HeiferID Breed FSGrp Year DRE;
model sqrtTmin = Year Date Breed|DRE FSGrp|DRE / ddfm=kr;
random DRE / subject=HeiferID type=csh residual;
run;

Lauren_Hanna · Posted 02-27-2023 05:51 PM

Thank you again Jill. This solution defeats the purpose of what I am trying to accomplish. I want to model the TIME variable since it has non-normal tendencies using PROC GLIMMIX so I can ensure modeling assumptions are met while also being able to find the inverse link of the predicted means and standard errors of fixed effects (i.e., having the predicted means/SE in TIME scale, not square root). Doing so in PROC GLIMMIX with the same type of transformation as the link does not improve that modeling effort (see original PDF). I have not come across a fit better than the sqrtTmin in PROC MIXED so far, but I cannot get back-transformed estimates that route. Trying to use a different distribution with that link = power(0.5) also poses problems. Do you have any recommendations in this case?

jiltao · Posted 02-28-2023 09:15 AM

Then I am not sure why you wanted to "mirror" the PROC MIXED model. PROC MIXED assumes the response variable being normal.

You might use PROC GLIMMIX to fit a model, use DIST= to specify a distribution that might work better than normal for your data.

SteveDenham · Posted 03-08-2023 02:09 PM

Elapsed times often result in a gamma distribution.

SteveDenham

Lauren_Hanna · Posted 03-23-2023 12:43 PM

Steve - Thank you for the thoughts there. I changed the distribution to Gamma with default link setting. The distributions of the residuals do not look different, but they are tighter around zero - most with the +/- 1 range. There were a couple above +1 (1.56 was highest), but I could not see anything in their raw form that would justify removing them. I think we can proceed with this fit and be comfortable with the outcomes.

I appreciate both of your comments here!

PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

Re: PROC MIXED vs PROC GLIMMIX - Outcomes not the same

2025 SAS Hackathon: There is still time!