BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Julia_Mang
Fluorite | Level 6

Dear SAS community,

 

I am running a simulation study with different hierarchical models and different weighting scenarios with proc glimmix. I use e.g. for the nullmodell the following code:

 

proc glimmix data=data method=quadrature empirical=classical;
class cluster;
model ach = / dist=normal solution;
random int / subject=cluster weight=schwt solution;
by rep;
ods output ParameterEstimates=param1 CovParms=param2;
run;

 

Data have been simulated in R having for this scenario variable ach ~N(500,100) and ICC=0.1 in each of 1000 data sets.

 

When I look at the estimated outcomes for each single "by" replication, I recover a very bad coverage rate, i.e. the estimated parameters are badly included in the confidence intervall with given population values from the simulation (e.g. for the intercept of this nullmodell the coverage rate is 71%; this appears by the way for all different weighting scenarios). 

 

I use the same simulated data and procedures also with the programm MPLUS and very high coverage rates (over 95%) appear. From my research, MPLUS is using the same estimation technique as I stated in the SAS syntax with method=quadrature and empirical=classical.

 

I assume that I am specifing still something wrong, otherwise I could not explain those bad coverage rates. Can you imaging what can couse this problem? Or should I use PROC MIXED instead of PROC GLIMMIX, but I though the latter one is preferred nowadays?

 

Thank you in advance for your help and kind regards,

 

Julia Mang

1 ACCEPTED SOLUTION

Accepted Solutions
Rick_SAS
SAS Super FREQ

I don't know anything about MPLUS, so I can't help you there. It sounds like this is a research project, so you might want to discuss it with an advisor or colleague and share the code with him/her.

 

Regarding the initial guesses for estimates, there are many ways to guess the starting values. I've written a few SAS-related posts that might interest you. They do not necessarily use PROC GLMMIX, but the ideas are generally applicable to regression procedures that estimate parameters by solving an optimization problem:

Lastly, I'll mention that comparing results across software packages requires careful reading of the docs and sometimes some detective work. Each software has different defaults, different optimization methods, and different objective functions (eg, reduced restricted maximum likelihood, pseudo-likelihood, or integral approximation by adaptive quadrature or Laplace methods.) It is sometimes not possible to use one software package to exactly duplicate the results of another. There are several good papers and books that you might want to read:

Good luck!

View solution in original post

4 REPLIES 4
Rick_SAS
SAS Super FREQ

When this happens it usually means either

1. The samples are small and the asymptotic coverage probabilities do not apply

2. The simulated data were not generated by the same model that is being tested.

If (2), go back and review how you are simulating the data. Make sure that the random intercept is only generated once per cluster.

 

I am not familiar with the weighting scheme you are using, but the GLMMIX doc does say "PROC GLIMMIX uses the weights provided in the data set directly. To use the scaled weights, you need to provide them in the data set."  Check that MPLUS and GLMMIX are treating the weights in the same way.

Julia_Mang
Fluorite | Level 6

Dear Rick_SAS,

 

thank you for your answer and sorry for my delayed reply.

 

I checked all your suggestions and here is what i can give as feedback:

 

1) Each sample includes 5000 cases, which should be large enough

2) the random intercept in the data simulation is only generated once per cluster. I simulate the data with R before I run the analyses in MPlus and SAS

 

Weights are simulated and accordingly scaled also beforehand in R and used in MPlus and SAS the same way.

 

My next consideration, in order to isolate the problem, would be to read the estimate of the analyses from Mplus as a starting value into SAS and run the MC simulation again. Unfortunately, I only find the possibility for the procedure PROC glimmix to define starting values for the variances via Parms. Or do I overlook the possibility of also reading in start values for the parameter values here?

 

I would be very grateful for feedback!

 

Kind regards,

 

Julia

 

Rick_SAS
SAS Super FREQ

I don't know anything about MPLUS, so I can't help you there. It sounds like this is a research project, so you might want to discuss it with an advisor or colleague and share the code with him/her.

 

Regarding the initial guesses for estimates, there are many ways to guess the starting values. I've written a few SAS-related posts that might interest you. They do not necessarily use PROC GLMMIX, but the ideas are generally applicable to regression procedures that estimate parameters by solving an optimization problem:

Lastly, I'll mention that comparing results across software packages requires careful reading of the docs and sometimes some detective work. Each software has different defaults, different optimization methods, and different objective functions (eg, reduced restricted maximum likelihood, pseudo-likelihood, or integral approximation by adaptive quadrature or Laplace methods.) It is sometimes not possible to use one software package to exactly duplicate the results of another. There are several good papers and books that you might want to read:

Good luck!

Julia_Mang
Fluorite | Level 6

I would like to give you some feedback on my question because you took some time to answer it.

 

In the meantime I was able to find out that MPlus doesn't give empirical coverage rates as a result, but those which are based on a 5% error level for the assumption of a normal distribution. If I look at the empirical coverage rates in MPlus, I get quite exactly the same rates as in SAS.

 

Nevertheless, it has cost a lot of detective work to look behind the program syntax, which I personally find very unfortunate.

 

Many thanks again for your detailed help!

 

Best regards

Julia Mang

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 1038 views
  • 2 likes
  • 2 in conversation