About lvm

lvm · ‎04-24-2013

As far as I know, PROC POWER does not have a direct way of dealing with Poisson (others can correct me if I am wrong). However, one can use the approach advocated by Walt Stroup in the 2011 SAS Global Forum: http://support.sas.com/resources/papers/proceedings11/349-2011.pdf He was emphasizing GLMMs, but you don't have any random terms. The approach determines the power for different means and sample sizes. You have to try different sample sizes (variable: reps) until you obtain the desired power. I modified for Poisson. For reps=15 and your means (7 and 5.6 [a 20% drop]), power is only 0.33. Try reps=50 and you will see that power is about 0.8. title 'Power analysis for Poisson, 2 treatments'; title2 '(based on Stroup, 2011, 2012)'; data power_poisson; input trt $ mean; reps=15; *<--change to any number you want to test; do obs=1 to reps; output; end; datalines; control 7 exper 5.6 ; run; proc print data=power_poisson;run; proc glimmix data=power_poisson; *<--mostly ignore the output here, ODS tables are stored for manipulation below; class trt; model mean = trt / chisq link=log dist=poisson; contrast 'control vs experimental' trt 1 -1 / chisq ; ods output tests3=F_overall contrasts=F_contrasts; run; data power; *<--this data step takes relevant parts of above output to get power at given N and treatment means; set F_overall F_contrasts; nc_parm=numdf*Fvalue; alpha=0.05; F_Crit=Cinv(1-alpha,numdf,0); Power=1-probchi(F_crit,numdf,nc_parm); proc print data=power; *<--this printout is the relevant part, look at far right for realized power; run; This approach is described in SAS for Mixed Models, 2nd Edition (2006) and in Stroup book (2012, chapter 16). I changes for Poisson. You can expand for any number of treatments, treatment means, and use different distributions. With random effects, you would need to to hold those variances constant. See the listed paper and these references for more details. If you use this approach, you should definitely cite Stroup.

lvm · ‎04-19-2013

It would help to know more details. Are you trying to estimate one or more means with a fixed level of precision or are are trying to achieve a certain power for a contrast (such as the difference of two means)? Or something else? Do you have any random effects?

lvm · ‎04-18-2013

I usually don't do superiority (or noninferiority) testing, so it would be good if others jumped in with comments. But here are some things that can help. In superiority testing, you usually perform one-sided tests; moreover, there may be a so-called margin (a constant, c). So, the alternative hypothesis (for treatments A vs B) could be Ha: mu_B - mu_A > c (where c could be 0). Many also advocate using a different alpha for hypothesis testing (say, 0.025). (You also must be careful about directions: is a positive difference a good or bad thing?). One can easily do one-sided testing with MIXED, by just putting the UPPER option in an ESTIMATE statement. However, one cannot incorporate a margin easily. But you can do this with the PLM procedure, which is called after running MIXED. The margin is called the TESTVALUE (as an option in LSMESTIMATE or ESTIMATE). Here is a simple program to generate four treatments in a block design. The results are STOREd, and then processed in PLM. I show a simple difference of B and A, and then a one sided test, and then a one-sided test with a margin of +1, and then change the alpha to 0.025. More statements could be added for other treatments. I assume here that you want to know if B is superior to A. In this generated example, B is superior to A when the margin is 0, but not when it is 1. (As I said above, those who do superiority testing may have more to say, and have better suggestions. But this approach does work for one-sided tests, with or without a nonzero margin. The question remains: is this what you need?). data s; do block = 1 to 8; bl = rannor(1); do trt = 1 to 4; resid = rannor(1)*1.3; mean = 10*(trt=1) + 12*(trt=2) + 12*(trt=3) + 16*(trt=4); y = mean + bl + resid; output; end; end; run; proc print data=s; proc mixed data=s; class block trt; model y = trt; random block; store sasuser.sfile; run; proc plm restore=sasuser.sfile; lsmeans trt; lsmestimate trt 'trt A vs B' -1 1/ cl; lsmestimate trt 'B sup to A?' -1 1 / upper cl; lsmestimate trt 'B sup to A + marg?' -1 1 / upper testvalue=1 cl; lsmestimate trt 'B sup to A + marg, alpha=2.5%' -1 1 / upper testvalue=1 alpha=.025 cl; run;

lvm · ‎04-10-2013

For problems like this, it may be helpful to start with a simpler model, such as one with no random effects. This will tell you if you have some problems with the coding of the data (for the fixed-effects part) , or if you have evidence of finite mixtures. Look at the residual plots. Then start adding the relevant random effects. You do have problems with your random effect syntax. First of all, with negative binomial, you automatically have an overall residual (R-side) scale parameter for the conditional distribution. But then in your original post you have two separate random statements that are both coding the R-side residual scaling parameter (either using the RESIDUAL option or the _RESIDUAL_ keyword). This is creating all kinds of conflicts and overparameterizations. Plus, I think you will get very strange results (and maybe not interpretable results) by trying a temporal autocorrelation structure for the R-side residual. If you read the recent discussion in the SAS/STAT discussion board, you will see some arguments against R-side repeated measures for GLMMs (open to debate). Also, the chol option won't do anything here because you have a simple variance component term for this statement. Start with a simpler random structure: random int bird / sub=nest; which is equivalent to random nest bird(nest); At least, this will let you get started. If this works,then you can consider more complex models (if needed).

lvm · ‎04-10-2013

While I was writing my (overly long) reply, several more postings were made. Some of my comments duplicate comments by Steve.

lvm · ‎04-10-2013

There are numerous issues involved in Steve’s original message. Some of this is philosophical and some is technical. If you are modeling R-side (co-)variation with a GLMM, you may be performing a strictly quasi-likelihood analysis, whether you realize it or not. For instance, if the conditional distribution (i.e., conditional on the G-side random effects) does not have a free scale parameter (binomial and Poisson, for instance), then any R-side modeling is incorporating a multiplicative scale parameter that would not be there for these distributions. As stated on page 128 of Stroup (2012), “No actual probability distribution exists with this structure, but in many cases it adequately models the distribution of the count [or proportion] data, and the quasi-likelihood is perfectly well defined for ... estimation and inference purposes.” So, this is one way of getting at the usually intractable marginal distribution. If you don’t have random (G-side) effects, you are getting GEE analysis with this situation, which is used very successfully for GLMs without other random effects. We “see” marginal distributions, that is, observations are from marginal distributions. But, it can be argued that observations are generated from conditional distributions, that is, a conditional model comes closer to capturing the data-generating mechanism. This is certainly a take-home message in Stroup’s book (although I am sure that I am greatly oversimplifying a much bigger topic—sorry). This theme is found throughout his book. In the marginal-vs.-conditional debate, it is often overlooked that the two kinds of models are targeting different parameters; Stroup makes a compelling argument that the typical investigator is more interested in the targeted parameter from the conditional model (such as the conditional binomial probability). I basically agree with this, but I am sure this can be debated. The more I learn about GLMMs, the more I am leaning to the conditional-model approach to analysis. However, there can be important uses for marginal models, so I am not going to get into any major on-line debates about this. However, in terms of repeated measures, I have a difficult time conceptualizing what an autoregressive (or other structure) means for the multiplicative scale parameter (say, with overdispersion for a “binomial” distribution). I can conceptualize this with a random effect in a conditional model. For exponential-family distributions with a free-scale parameter (e.g., gamma, negative binomial, and other two-parameter conditional distributions), R-side analysis (with RANDOM _RESIDUAL_ / …) makes sense as a true likelihood analysis (not quasi-likelihood). But one must be careful in fitting a model. This is technical issue with the analysis. For instance, a RANDOM _RESIDUAL_; statement here would create another multiplicative scale parameter, so that the overall scaling would be the product of two constants; there would be no unique estimates for the two scale terms (a form of overparameterization). However, statements like RANDOM _RESIDUAL_ / group=TRT; would be useful to indicate that there is separate scale parameter for each treatment (etc.). When you get into repeated measures analysis for the gamma and negative binomial, things can get very messy. If you specify, for instance, an AR(1) structure for R-side analysis, you are defining a working correlation matrix. As stated by Stroup (page 435), “it is not clear how the working correlation parameters co-exist with the scale parameters intrinsic to the [conditional] distribution… The area in need of further development is clearly the two-parameter [non-normal] exponential family.” My view is that a lot is unknown about R-side analysis for two-parameter non-normal distributions—good research opportunities for statisticians.

lvm · ‎04-09-2013

As Steve says, the statements in GLIMMIX random _residual_ / sub=___ type=____; or random ___ / sub=____ type=____ RESIDUAL; do the same thing as the REPEATED statement in MIXED. Since you had a model in MIXED that you wanted to fit in GLIMMIX, I showed you the appropriate code.

lvm · ‎04-08-2013

It looks like you have several random effects (based on the test statements in GLM) but you are not specifying any random effects with RANDOM statements. This is the very old fashioned way of doing split plots, nested designs, etc. This is now much easier and simpler using true mixed-model software (such as MIXED, HPMIXED, and GLIMMIX), if you learn how to specify the random effects. (GLM is not a true mixed-model procedure). But you have to learn how to list the random effects, and not put them in a model statement. Then, all of these individual test statements will be automatically handled (you won't need to write them out). There is a bunch to learn, and you should start with the great book: SAS for Mixed Models, second edition (2006) by Littell et al. When the MIXED procedure was new in the mid-1990s, there were several articles written explaining the better way of fitting these models and testing effects. You might be able to find some from the SAS USers' Group International online (not called SAS Global Forum).

lvm · ‎04-08-2013

No, your glimmix code is fitting a different model. Just take your MIXED code and change your REPEATED statement to: RANDOM TRIAL / SUBJECT=SUB RESIDUAL TYPE=CS; If you do not have missing values for trial (i.e., same trials for each subject), you can use: RANDOM _RESIDUAL_ / SUBJECT=SUB TYPE=CS;

lvm · ‎03-26-2013

There are different macros that have been written to do the Dunn's multiple comparisons. A google search gives many hits. The book Pharmaceutical Statistics Using SAS: A Practical Guide (by Dmitrienko et al.) describes a good one (chapter 6). The data examples and macros for this book are all available at: http://support.sas.com/publishing/bbu/zip/60622.zip You will have to learn about using macros, if you haven't used them before.

lvm · ‎03-22-2013

Yes, you need dummy variables. If X3 has three levels, then you need two dummy variables. Using GLM-type parametrization, you would need X3_1 (where X3_1 = 1 if X3 is at level 1, 0 otherwise) and X3_2 (where X3_2 = 1 if X3 is at level 2, and 0 otherwise). Same for X4. This means you could have many parameters to estimate and interpret with proc nlin or proc nlmixed. It can get tricky.

lvm · ‎03-20-2013

I am sure you have many choices for location of the so-called item store. Check the documentation. Here is a nice article on the many uses of PLM. http://support.sas.com/resources/papers/proceedings10/258-2010.pdf

lvm · ‎03-20-2013

Just a follow up to my previous email. If you have two factors, for instance, you can use: proc glimmix data=sim; class block A B; model y = A|B; random block; lsmeans A*B / lines; slice A*B / sliceby=B lines; *--get comparisons of A for each B level; run; The LSMEANS A*B statement is doing all possible mean comparisons for the interaction of A and B (which likely includes many comparisons that are not of interest). THe LINES options is giving the letter display. If you have 6 levels of A and 3 of B, the A*B table has 18 means for comparisons. Note that the SLICE statement is testing for the A effect at each level of B. There will be an F test for A at B=1, another F test for A at B=2, and so on. The LINES option on SLICE is giving you the multiple mean comparisons for all the A levels at B=1, then at B=2, and so on. If you really must use MIXED, then the following will be needed: proc mixed data=sim; class block A B; model y = A|B; random block; lsmeans A*B; store sasuser.MCP1; run; proc PLM restore=sasuser.MCP1; lsmeans A*B / lines; slice A*B / sliceby=B lines; *--get comparisons of A for each B level; run; I used the new item store (STORE ....) in MIXED, and then followed this with the PLM procedure to do the letter (line) display of means and the slices of B. You can't do either directly in MIXED. For your information, the stored sasuser.MCP1 file is now permanently on your computer, which means you could in a future go right to the PLM step and not use MIXED at all (unless the data or model changes).

lvm · ‎03-20-2013

You don't need the macro anymore. Just use GLIMMIX (normal distribution is the default). Example: LSMEANS trt / diff lines; or LSMEANS A*B / diff lines; The LINES option gives the letter display. You probably want to adjust for multiplicity if you have many treatment (factor levels). I like the simulate adjustment: LSMEANS trt / diff lines adjust=simulate; There are other adjustment options. If you have a factorial and you want to just do the mean comparisons within slices of the "other" factor, then you will need to use the new SLICE statement. You can't do this directly with MIXED. But if you have SAS 9.3 or later (maybe 9.22), there is a way to store the results from MIXED and use them in the new PLM procedure for post-model fitting processing (such as means comparisons).

lvm · ‎03-09-2013

You were only asking about fixed effects in your original posts. In general, MIXED or GLIMMIX are designed to fit only one model at a time. So, you have to run the procedure twice. But for random effects, it is a different story. With GLIMMIX (not MIXED), you can use the COVTEST statement (don't confuse this with the covtest option in MIXED). Check out the documentation. This is a powerful tool for testing random effects using likelihood ratios.

Online Status	Offline
Date Last Visited	‎10-02-2024 05:21 PM

Re: mianalyze of lsmestimate

Re: mianalyze of lsmestimate

Re: TEMPLATE: how to combine the equivalent of LAYOUT LATTICE and LAYO...

TEMPLATE: how to combine the equivalent of LAYOUT LATTICE and LAYOUT D...

Re: SAS code for proc glimmix data - interaction analysis

Re: Mixture of chi square with NLMixed in sas

Re: Stepwise Model Selection for longitudinal binary data using PROc G...

Re: Calculating weight for site effect based on standard error

Re: Estimating treatment effects, 2 Group Pre-Post Matched Analysis

Re: Proc Mix insufficient memory issue

Re: GLIMMIX: order of random variable syntax

Re: mianalyze of lsmestimate

Re: mianalyze of lsmestimate

Re: SAS code for proc glimmix data - interaction analysis

poisson regression goodness of fit stats

Re: Stepwise Model Selection for longitudinal binary data using PROc G...

Re: Sample size calculation and count data/Poisson regression

Re: Sample size calculation and count data/Poisson regression

Re: Superiority testing and PROC MIXED

Re: Another GLIMMIX repeated-measures question

Re: R side vs. G side for PROC GLIMMIX

Re: R side vs. G side for PROC GLIMMIX

Re: Proc GLIMMIX

Re: Integer overflow in Proc GLM

Re: Proc GLIMMIX

Re: Dunn's Test

Re: How to estimate the parameters for nonlinear function?

Re: how to get pdmix800.sas for letter displays of pairwise mean comp...

Re: how to get pdmix800.sas for letter displays of pairwise mean comp...

Re: how to get pdmix800.sas for letter displays of pairwise mean comp...

Re: Likelihood ratio tests for mean structures in proc mixed: can it ...