BookmarkSubscribeRSS Feed
G_Le_Teuff
Calcite | Level 5

Dear SAS users,

I would like to have some information about Proc glimmix in order to perform a meta-analysis using count data (incidence rate).

There are two main approach in meta analysis : fixed and random effect. The first one is based on the inverse-variance weighted.

Some examples using proc mixed for normal data exist (See Normand, Stat Med). Based on these exemples I will try to apply on cound data using proc glimmix. Two programs are possible. One based on weighted option and the second consists to fix the estimated variance of incidence rate.

proc glimmix data=count_data /*empirical*/;

class study;

model tox =  / dist=nb link=log offset=logPA2 solution;   * tox = number of adverse event - PA2 = sum of person-years;

random intercept   / subject=study ; * to test heterogeneity between study;

weight inv_varln; * weight = inverse variance;

run;

proc glimmix data=count_data;

class study;

model tox =  / dist=nb link=log offset=logPA2 solution;

random intercept   / subject=study;

random intercept   / subject=study group=study;             * I use it but not sure, the objective is to sepcify the variance of incidence rate of each study;

parms  (0)

           (0.125) (0.333) (0.031) (0.2) (0.0625)                    * I fixe the variance of ln(incidence rate) for each study;

           (0.0169) (0.005) (0.007) (0.055) (0.0476)

           (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

          (1) /  hold=(2 to 15);  * the last value is the scale parameter;

run;

These two programs did not give similar results. Could someone explain me why ? While using proc mixed with weight and repeated give similar result in norma data. Do I use the variance of incidence and not the variance of ln incidence ?

I am not sure to the second random statement using group=study statement. I insert it in order to have a variance -covariance matrix with size equal to the number of studies.

Best regards

Gwénaël

18 REPLIES 18
SteveDenham
Jade | Level 19

I've never attempted anything like this, but I did notice one curiosity in your code for the second model.  You have two random statements, but they are much the same, so that the first statement results in an odd occurrence.  What about a statement that looks like:

random intercept/subject=study type=un(1):

I think this would give a diagonal matrix, with the diagonal having the variances for each study.  Perhaps lvm will comment, and help out on this one.

Steve Denham

G_Le_Teuff
Calcite | Level 5

Dear Steve , I used the code you indicated (See below) but it doesn't work.

proc glimmix data=count_data;

class study;

model tox =  / dist=nb link=log offset=logPA2 solution;

random intercept   / subject=study;

random intercept   / subject=study type=un(1);

parms  (0)

           (0.125) (0.333) (0.031) (0.2) (0.0625)                    * I fixe the variance of ln(incidence rate) for each study;

           (0.0169) (0.005) (0.007) (0.055) (0.0476)

           (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

          (1) /  hold=(2 to 15);  * the last value is the scale parameter;

run;

ERROR: 3 PARMS must be given instead of 20

I tried

random _residual_ / group=study; but did not converge

random 

_residual_ / subject study;  ERROR: 3 PARMS must be given instead

random

intercept / subject=study type=un(1 ) group=study; it works but Matrix G not defnite positive

Cov Parm Subject Group Estimate Error

Intercept study 0.000499 .

UN(1,1) study study 1 0.1250 .

UN(1,1) study study 2 0.3330 .

UN(1,1) study study 3 0.03100 .

UN(1,1) study study 4 0.2000 .

UN(1,1) study study 5 0.06250 .

UN(1,1) study study 6 0.01690 .

UN(1,1) study study 7 0.005000 .

UN(1,1) study study 8 0.007000 .

UN(1,1) study study 9 0.05500 .

UN(1,1) study study 10 0.04760 .

UN(1,1) study study 11 0.08300 .

UN(1,1) study study 12 0.03300 .

UN(1,1) study study 13 0.1000 .

UN(1,1) study study 14 0.06250 .

UN(1,1) study study 15 0.02120 .

UN(1,1) study study 16 0.02500 .

UN(1,1) study study 17 0.06250 .

UN(1,1) study study 21 6E-19 .

Scale 0.1579 0.08449

Solutions for Fixed Effects

Standard

Effect Estimate Error DF t Value Pr > |t|

Intercept -2.6138 0.1188 0 -22.00 .

So I am wondering how fix the covariance matrix using random statement

Gwénaël

SteveDenham
Jade | Level 19

I worry about the three parms instead of 20 error. I think it is coming from the following: there are three parms: first random statement = variance between studies, second random statement = also variance between studies, and residual/scale. This would also explain why you get G matrix not positive definite, as the matrix contains a redundant column when both random statements are included.

What about:

proc glimmix data=count_data;

class study;

model tox =  / dist=nb link=log offset=logPA2 solution;

random intercept/subject=study group=study type=chol(1) solution;

parms  

           (0.125) (0.333) (0.031) (0.2) (0.0625)                   

           (0.0169) (0.005) (0.007) (0.055) (0.0476)

           (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

          (1) /  hold=(1to 15); 

run;

Going to a single random statement and using the Cholesky root.  I offer type=chol, as it ensures that the varainace-covariance matrix is at least positive semidefinite, so that should help, as will removing the redundant random intercept/subject=study line.

Good luck.

Steve Denham

SteveDenham
Jade | Level 19

Or, since it looks like there is some variability accounted for by intercept study (0.000429), you might try:

proc glimmix data=count_data;

class study;

model tox =  / dist=nb link=log offset=logPA2 solution;

random intercept/subject=study:

random intercept/subject=study group=study type=chol(1) solution;

parms   (0)

           (0.125) (0.333) (0.031) (0.2) (0.0625)                   

           (0.0169) (0.005) (0.007) (0.055) (0.0476)

           (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

          (1) /  hold=(2 to 16); 

run;

This will estimate the first and last parameter, and hold the parameters for the fifteen studies constant.  I would still stay with the Cholesky parameterization of the G matrix.

Steve Denham

G_Le_Teuff
Calcite | Level 5

Steve,

I ran your 2 proposed models and compare with a Negative Binomial random model using weight statement(=inverse variance) and a NB random model with type=simple.

Note: I updated the number of fixed variance parameters.

1 - Model with weight (inverse variance)

proc glimmix data=count_data /*empirical*/;

                        class study;

                        model tox =  / dist=nb link=log offset=logPA2 solution;

                        random intercept   / subject=study ;

                        weight inv_varln;

                        where groupe=1;

run;                                      

                               -2 Res Log Pseudo-Likelihood       29.74

                               Generalized Chi-Square             16.87

                               Gener. Chi-Square / DF              0.99

                                    Covariance Parameter Estimates

                                                                 Standard

                             Cov Parm     Subject    Estimate       Error

                             Intercept    study        0.2245      0.1309

                             Scale                     0.8371      1.8751

                                     Solutions for Fixed Effects

                                           Standard

                  Effect       Estimate       Error       DF    t Value    Pr > |t|

                  Intercept     -2.6600      0.1244       17     -21.38      <.0001

2 - random Model fixing variance covariance  (R)

proc glimmix data=count_data;

                        class study;

                        model tox =  / dist=nb link=log offset=logPA2 solution;

                        random intercept   / subject=study;

                        random _residual_  / subject=study group=stud y type=simple;

                  parms  (0)

           (0.125) (0.333) (0.031) (0.2) (0.0625)

                                                  (0.0169) (0.005) (0.007) (0.055) (0.0476)

                                                  (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

                                                  (0.025) (0.0625) (0.0169) (0.0060)             / hold=(2 to 19);

                        where groupe=1;

run;

                               -2 Res Log Pseudo-Likelihood       29.74

                               Generalized Chi-Square             16.87

                               Gener. Chi-Square / DF              0.99

                                    Covariance Parameter Estimates

                                                                         Standard

                     Cov Parm         Subject    Group       Estimate       Error

                     Intercept        study                    0.2244      0.1308

                     Residual (VC)    study      study 1       0.1250           .

                     Residual (VC)    study      study 2       0.3330           .

                     Residual (VC)    study      study 3      0.03100           .

                     Residual (VC)    study      study 4       0.2000           .

                     Residual (VC)    study      study 5      0.06250           .

                     Residual (VC)    study      study 6      0.01690           .

                     Residual (VC)    study      study 7     0.005000           .

                     Residual (VC)    study      study 8     0.007000           .

                     Residual (VC)    study      study 9      0.05500           .

                     Residual (VC)    study      study 10     0.04760           .

                     Residual (VC)    study      study 11     0.08300           .

                     Residual (VC)    study      study 12     0.03300           .

                     Residual (VC)    study      study 13      0.1000           .

                     Residual (VC)    study      study 14     0.06250           .

                     Residual (VC)    study      study 15     0.02120           .

                     Residual (VC)    study      study 16     0.02500           .

                     Residual (VC)    study      study 17     0.06250           .

                     Residual (VC)    study      study 21     0.01690           .

                     Scale                                     0.8399      1.8777

                                     Solutions for Fixed Effects

                                           Standard

                  Effect       Estimate       Error       DF    t Value    Pr > |t|

                  Intercept     -2.6600      0.1244       17     -21.38      <.0001

3 - Model with no random effect and type=chol(1) for matrix R

proc glimmix data=count_data;

                        class study;

                        model tox =  / dist=nb link=log offset=logPA2 solution;

                        random intercept/subject=study group=study type=chol(1);

                        parms 

           (0.125) (0.333) (0.031) (0.2) (0.0625)                   

           (0.0169) (0.005) (0.007) (0.055) (0.0476)

           (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

                                                  (0.025) (0.0625) (0.0169) (0.0060)            /  hold=(1 to 19); 

                        where groupe=1;

run;

                               -2 Res Log Pseudo-Likelihood       99.51

                               Generalized Chi-Square            113.25

                               Gener. Chi-Square / DF              6.66

                                    Covariance Parameter Estimates

                                                                       Standard

                       Cov Parm     Subject    Group       Estimate       Error

                       CHOL(1,1)    study      study 1       0.1250           .

                       CHOL(1,1)    study      study 2       0.3330           .

                       CHOL(1,1)    study      study 3      0.03100           .

                       CHOL(1,1)    study      study 4       0.2000           .

                       CHOL(1,1)    study      study 5      0.06250           .

                       CHOL(1,1)    study      study 6      0.01690           .

                       CHOL(1,1)    study      study 7     0.005000           .

                      CHOL(1,1)    study      study 8     0.007000           .

                       CHOL(1,1)    study      study 9      0.05500           .

                       CHOL(1,1)    study      study 10     0.04760           .

                       CHOL(1,1)    study      study 11     0.08300           .

                       CHOL(1,1)    study      study 12     0.03300           .

                       CHOL(1,1)    study      study 13      0.1000           .

                       CHOL(1,1)    study      study 14     0.06250           .

                       CHOL(1,1)    study      study 15     0.02120           .

                       CHOL(1,1)    study      study 16     0.02500           .

                       CHOL(1,1)    study      study 17     0.06250           .

                       CHOL(1,1)    study      study 21     0.01690           .

                       Scale                               0.006000           .

                                     Solutions for Fixed Effects

                                           Standard

                  Effect       Estimate       Error       DF    t Value    Pr > |t|

                  Intercept     -2.7003     0.04557       17     -59.26      <.0001

4 - random Model and type=chol(1) for R matrix

proc glimmix data=count_data;

                        class study;

                        model tox =  / dist=nb link=log offset=logPA2 solution;

                        random intercept/subject=study;

                        random intercept/subject=study group=study type=chol(1);

                        parms  (0)

           (0.125) (0.333) (0.031) (0.2) (0.0625)                   

           (0.0169) (0.005) (0.007) (0.055) (0.0476)

           (0.083) (0.033) (0.1)  (0.0625)  (0.0212)

                                                  (0.025) (0.0625) (0.0169) (0.0060)            /  hold=(2 to 19); 

                        where groupe=1;

run;

                               -2 Res Log Pseudo-Likelihood       29.73

                               Generalized Chi-Square             16.89

                               Gener. Chi-Square / DF              0.99

                                    Covariance Parameter Estimates

                                                                       Standard

                       Cov Parm     Subject    Group       Estimate       Error

                       Intercept    study                    0.1043           .

                       CHOL(1,1)    study      study 1       0.1250           .

                       CHOL(1,1)    study      study 2       0.3330           .

                       CHOL(1,1)    study      study 3      0.03100           .

                       CHOL(1,1)    study      study 4       0.2000           .

                       CHOL(1,1)    study      study 5      0.06250           .

                       CHOL(1,1)    study      study 6      0.01690           .

                       CHOL(1,1)    study      study 7     0.005000           .

                       CHOL(1,1)    study      study 8     0.007000           .

                       CHOL(1,1)    study      study 9      0.05500           .

                       CHOL(1,1)    study      study 10     0.04760           .

                       CHOL(1,1)    study      study 11     0.08300           .

                       CHOL(1,1)    study      study 12     0.03300           .

                       CHOL(1,1)    study      study 13      0.1000           .

                       CHOL(1,1)    study      study 14     0.06250           .

                       CHOL(1,1)    study      study 15     0.02120           .

                       CHOL(1,1)    study      study 16     0.02500           .

                       CHOL(1,1)    study      study 17     0.06250           .

                       CHOL(1,1)    study      study 21     0.01690           .

                       Scale                                 0.1103     0.09373

                                     Solutions for Fixed Effects

                                           Standard

                  Effect       Estimate       Error       DF    t Value    Pr > |t|

                  Intercept     -2.6504      0.1239        0     -21.39       .

The random model with weight statement (model 1) and the random model with type=simple (model 2) give similar results. Note: When I use a poisson model the scale parameter is not defined. I need to insert a random _residual_  while scale parameter estimation is by default with Negative binomial. The last 3 models give a Pearson/df close to 1 (control of overdispersion).

The model 3 indicates the need to adjust for overdispersion chiSquare/df=6.66 (>>1).But  I realize I  make a mistake because I wrongly fixed the parm hold=(1 to 19)  leading to fix the scale parameter.

The model  4 (random effect type=chol(1)) give no SE estimation for random effect  and DF=0 for fixed effect. How explain  it ?  Very strange. Thank per advance for you help.

Gwénaël

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

There are several things to keep in mind with meta-analysis. First of all, with fixed or random effects meta-analysis, you need to fix the within-study variances. With normal distributions, this is straight-forward  by first fixing the residual at 1 and then using weights. (This is easily accomplished, using either mixed or glimmix (see my other posts on a similar topic)). Or, one specifies a separate within-study (i.e., "residual")  variance for each study and holds all of these fixed. You are trying this approach here, but there are some complications, so I don't think you are getting what you want. With a discrete distribution (Poisson or negative binomial (NB)), the within-study ("residual") variance is defined based on the distribution (=mean for Poisson, =(mean+(scale)*mean^2 for NB). Here, "scale" means the within-study scale parameter for the NB (equivalent to 1/k in other parameterizations). This residual scale term is on the scale of the raw data, not the link (but the other random statements refer to the scale of the link). You have the pre-specified within-study variances, but I don't believe they are being substituted for the above-described within-study variances, using your coding. Rather, I think with the first two approaches (your latest post) you are getting the internally-defined residual variance multiplied by the listed variance parameters (or the equivalent with the weights). This gets more complicated because, by using the NB distribution, the program is also estimating a NB scale parameter based on data across the studies (that is the scale parameter being displayed). Thus, it appears that none of your choices is really fixing your within-study variances at the values you want.

I have done a lot with meta-analysis, but either with normal data, or with non-normal data but without fixing the within-study variances (using inidiviudal data within the studies). I have definitely not tried this using the NB distribution (with its complication of estimating a separate scale across the studies).  I don't have an exact suggestion right now, but will think about it more. Here is another complication. If you substitute a variance for "residual", then you no longer have a true Poisson or NB, since these are defined, in part, by the variance:mean relation.

Here is one approach to consider. Choose Poisson, and determine the "residual" variance based on the mean. Then re-scale your listed within-study variances, so that the product gets you back to the desired within-study variances. Then use your method 2 (last post). I need to think about this some more. By the way, your approaches 1 and 2 are closest to what you want (with the qualifiers that I list here).

G_Le_Teuff
Calcite | Level 5

Thank. but I am not sure how to determine the "residual" variance based on the mean. Do I have to take the total number of toxicities (tox) or the incidence rate i.e total number of toxicities divided by person-years. Here A=mu for Poisson model.

This question because the weigth is based on the inverse-variance and an estimation of the variance is c/person-years**2 for incidence rate and 1/c for log(IR).

In the case of binomial data, I have to use the same approach ? with A=pi(1-pi).

Thank par advance

G_Le_Teuff
Calcite | Level 5

To be sure to well understand your proposal and following my last post:

In a data set, I add 2 supplementary lines

mu=tox;

varlnbis=varln/mu;w_varlnbis=w_varln/mu;

When I used mu=log(tox) I obtain the following results

* weighted approach          -2.6739(0.1286) random=0.2824(0.1030)

* fixed the new variance     -2.6778(0.1302) random=0.2980(0.1050)  

and with mu=tox the following results

* weighted approach          -2.6765(0.1275) random=0.2352(0.099)

* fixed the new variance     -2.6778(0.1307) random=0.3047(0.1056)  

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

You are on the right track based on my previous comments. The difference is still probably due to the estimated scale for the chosen distribution. But the more I think about it, I don't think that Poisson or NB are appropriate for your analysis. Even if the original data in each study are discrete, either Poisson or NB, you are doing the meta-analysis on an estimated parameter from each study. This parameter estimate would be continuous, and probably approximated described by a normal distribution. Your within-study variance (or its inverse, the weight) is based on large-sample (normal) theory. Thus, I think you could avoid a lot of confusion by directly analyzing the parameter estimate for the studies using normal-based approaches.

You can continue with use of GLIMMIX, but base the analysis on LOGtox. The weight should be the inverse of the within-study variance for the logtox for each study. You still need a parms statement, so that the residual is held fixed at 1. The combination of the weight and the parms statements gives you fixed within-study variances and an estimated between-study variance. The output says that the residual is 1, but because of the weights, the residual is actually different for each study (=1/weight).

proc glimmix data=count_data;

class study;

model LOGtox =  / offset=logPA2 solution;  

random intercept   / subject=study ; * to test heterogeneity between study;

weight inv_varln; * weight = inverse variance;

parms (1) (1) / hold=2;

run;

You can read more at:

http://www.lexjansen.com/pharmasug/2000/stats/st09.pdf

(this was written before GLIMMIX was available, but it is very useful).  There are also other ways of using a long parms statement and no weight.

G_Le_Teuff
Calcite | Level 5

Thank you for this information but I have some difficulties to choose what variance used to compute the weight. I am only able to calculate the variance of incidence rate or log(incidence rate) but I do not know which used.

For a poisson regression, the within-study variance is defined as mean but on which scale : incidence rate or log(incidence rate) ? Could you indicate me on which scale the variance of a poisson model is calculated. The model is log(tox/PA^2) = intercept + random effect. Variance=mean but mean is what ? log(tox/PA^2) or tox/PA^2 or tox or log(tox). Sorry if my questions are stupid.

For a normal regression (y=log(tox)), do I calculate the variance of log(tox/PA2) ?

After that, I will be able to compare the 2 approches (poisson and normal).

Thank you per advance

G_Le_Teuff
Calcite | Level 5

Please below the results of different approaches

                                                                                           Fixed                    Random         -2ResLL

Poisson using weight = inverse-variance of incidence rate      -2.6779 (0.1309)       0.3078(0.1058)  31.13

Poisson using weight = inverse-variance of log incidence rate -2.6739(0.1286)       0.2824(0.1030)   30.78

Normal using weight=inverse-variance of log incidence rate*     -2.7431(0.1395)      0.3131(0.1197)   36.05

* because the response is y=log(tox)

The -2ResLL is different between the first 2 models and normal model. Poisson model seem better fit than Normal model. Based on these results, could we select the poisson model ?

Additionnaly, can you confirm that the random effect as estimated in log scale. THANK

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

It looks like you are on the right track. The results are not expected to be the same. You can't compare log-likelihoods because pseudo-likelihood is used for the Poisson distributions but true restricted likelihoods are used for the normal distribution.

G_Le_Teuff
Calcite | Level 5

But the results can be more different : I applied these 3 approaches on other data (specific toxicities and not total toxicity). So, no simple solution to this problem. Yes, it was what I am thinking about the comparison of Poisson and Normal distribution.

I am wondering whether you will be interest to collaborate on this topic ?

G_Le_Teuff
Calcite | Level 5

For information, please find enclosed a reference

Mixed-effects Poisson regression models for meta-analysis of follow-up studies with constant or varying durations. The international journal of Biostatistics. Vol 5, Issue 1, 2009. P.G .Bagos, G.K. Nikolopoulos. Examples are given but using Strata.

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 18 replies
  • 5601 views
  • 0 likes
  • 3 in conversation