About lvm

lvm · ‎06-22-2015

I think you should switch to PROC MIXED (or GLIMMIX). Much better functionality, and you get the t values and df (using many different possible df methods).

lvm · ‎06-19-2015

THis is what is supposed to happen. In GLIMMIX (also MIXED, GLM, etc.) an overparameterized model is used.This is needed to properly test for factor effects and get expected values. With three factor levels, you need a model with three parameters, but the overparameterized model has four parameters (intercept, beta_A, beta_B, and beta_C). The last one ends up as 0 in the estimation. For one factor, this means that the intercept is the expected value for the last level (C), and the other parameters are differences from C. For instance, the expected value for A is intercept + beta_A. This is described in the User's Guide.

lvm · ‎06-18-2015

Agree with Steve, overall. For the variance components model, it shouldn't make a difference (for most purposes) if you take out or leave in the main effect random term (assuming you don't use the NOBOUND option). As the model gets more complicated, the nonpositive definite G property can be quite problematic. Then you can take out the main effect for the random term. Model fitting be quite a bit easier when 0 variance terms are not included. If you are uncertain about denominator degrees of freedom for the different models, use ddfm=KR option on the model statement. This will typically make everything work out. With NOBOUND, things are different. The 0 variance for the main effect may end up as a negative "variance". So, all the random terms are needed. This negative variance is nonsensical for a conditional interpretation of a model, but works fine for the marginal interpretation (i.e., as long as the TOTAL variance is positive).

lvm · ‎06-17-2015

Good point about statistical software. This can lead to a lot of confusion in comparing output from different programs with the same data and same model. In MIXED and GLIMMIX, however, one does get the complete likelihood (or restricted likelihood) in the information-criterion statistics, including the term(s) that does not depend on the the parameters. For REML estimation, only variance-covariance parameters are used in the calculation because the fixed effect are eliminated at each ML step. I hope to see a compete output from the OP to help figure out this issue.

lvm · ‎06-17-2015

By the way, the formula for BIC is -2LL + p.ln(n). You gave the formula for AIC. Also note that p is the number of variance-covariance parameters with REML estimation (what you are doing). The fixed effect parameters have nothing to do with it (you may already know this, but other readers may not). Are all your variance-covariance parameters 0? Steve also has a good point. Would be good to see all your output.

lvm · ‎06-14-2015

I can see why you would have lots of questions about the two kinds of parameterizations. Justification for the GLM parameterization would take lengthy explanations, not suited to a discussion answer (at least I can't think of easy and short explanations -- maybe others can). This all goes back to the 1970s when Jim Goodnight (founder and still CEO of SAS) was deciding on the parameterization for PROC GLM (and others). Basically, they needed to come up with a general approach that would work for any factorial, including nesting, and so on. He and colleagues made it quite clear why the GLM (over-)parameterization was needed, in general, for factorials. There are some old papers and reports that I don't have available right now. It has to to with estimable functions, a concept that is often difficult to understand. For the over-parameterization, it looks like those zero parameters are not there, but they really are. SAS does not just find the last levels (or whatever you make the reference) and makes them 0. They are present in the model fitting. But the generalized inverse of the matrix in the model-fit step ends ups with 0s for those terms (the constraint used to get the inverse). But they are still there. This may seem to be the same thing, but it's not. One cannot define expected values of main effects (and maybe other effects) unless those zero parameters are there. Likewise type3 (or other) tests of hypotheses (especially for main effects) can only be interpreted in terms of expected values (means) with the GLM parameterization. If you are only interested in the highest-level interaction, you don't need to be concerned about this (other than in defining the contrast of interest). Or if you have only one factor. Also, if all the variables in your model are factors, then all of these parameterizations are giving you the same model fit (just with different coefficients). But some things cannot be done with the reference parameterization. By the way, you can control the reference level with the GLM parameterization (I show this below). I can get LSMEANS, etc., etc., with this parameterization. (Be careful: SAS rearranges the parameters. See in the Solution table that T1 goes last. This changes the order of things in the estimate statement). This looks very much like your output with reference parameterization, but it has the subtle differences I describe above. Also, not so subtle differences (the type3 tests are totally different for the main effects of G and T with reference and glm parameterization). Finally, only GLM parameterization is possible directly with PROC GLM, MIXED, and GLIMMIX, the flagship procedures in SAS for factorials. This is not an accident. This is the parameterization that works best, and makes the most sense, as a general framework for factorials. For your application, it doesn't matter. proc genmod data=test; class G (ref='3') T (ref='1') / param=glm; model y = G|T / dist=bin type3 ; lsmeans G*T ; estimate 'G1 vs rest at T1, pos.' G 1 -0.5 -0.5 G*T 0 0 0 1 0 0 0 -0.5 0 0 0 -0.5 ; lsmestimate G*T 'G1 vs rest at T1, nonpos.' [1,1 4] [-0.5,2 4] [-0.5,3 4] / ilink ; run;

lvm · ‎06-12-2015

Actually, the estimate is -1.24 on a logit scale using both approaches. These must agree. You can calculate this by hand by looking at theinteraction means with the GLM parameterization. Be careful, my estimate syntax assumes that the class statement is G T (not T G). The type3 test statistics are very different for the main effects. proc genmod data=test; class G T / param=glm; model y = G|T / dist=bin type3 ; lsmeans G*T ; estimate 'G1 vs rest at T1, pos.' G 1 -0.5 -0.5 G*T 1 0 0 0 -0.5 0 0 0 -0.5 0 0 0; lsmestimate G*T 'G1 vs rest at T1, nonpos.' [1,1 1] [-0.5,2 1] [-0.5,3 1] / ilink; run; proc genmod data=test; class G (ref='3') T (ref='1') / param=ref; model y = G|T / dist=bin type3 ; *<--sometimes it is useful to look at estimated EFFECTS (alpha, etc.); estimate 'G1 vs rest at T1' G 1 -0.5; run;

lvm · ‎06-11-2015

Using your reference parameterization and your contrast statement, one gets the same result as using my GLM parameterization and my contrast statement. This needs to occur because the contrast here is basically a so-called simple effect of interaction means. For these kinds of questions, either parameterization (with corresponding correct contrasts) will give valid results. My concerns about reference parameterization with factorials has to do with meaning of test statistics for main effects, expected values, and so on.

lvm · ‎06-11-2015

You need an INPUT statement before time in your data step (in addition to the other INPUT statement). When I copied and pasted your direct code in SAS, it generated an error. I missed that you were using reference parameterization. Just overlooked that. Sorry. Your coding for an estimate statement appears correct for this parameterization. With that said, I personally have a strong dislike of the reference parameterization with factorials, but it may be fine for your purposes. Meaning of tests of main effects (and interactions?) is strained, at best. There has been several discussions of this in the past at this site. Type3 tests are not testing the equality of main effect means to each other. In fact, expected values may not be defined with this parameterization. Try putting in a LSMEANS statement with reference parameterization, and you will get an error message that one needs the GLM parameterization for expected values, including the use of LSMESTIMATE. The GLM parameterization is actually not comparing the groups to the mean, but to the "last" level (or levels) of the factor (this is because GLM is an overparameterization). Contrasts can give all the needed other comparisons. Of course, many find reference parameterization useful, and it may be fine for your needs.

lvm · ‎06-10-2015

You don't have the syntax right. I am assuming that cohort in the class statement should be Group. Your data step won't work because you forgot the INPUT statement. But getting at your problem: To understand your needed syntax for ESTIMATE, write out your model for each group at time 1: mu11 = int + G1 + T1 + G1T1 mu21 = int + G2 + T1 + G2T1 mu31 = int + G3 + T1 + G3T1 You want mu11 - (mu21 + mu31)/2. So, substitute the full equations for the mu values, and do some algebra. You get: G1 - 0.5G2 -0.5G3 + G1T1 - 0.5*G2T1 - 0.5G3T1. I show below how to do this with positional syntax and nonpositional syntax with ESTIMATE (which works on the model parameters), and how to do with with the LSMESTIMATE statement (which works with the means, so you don't need to put all the parameter components). Here I used normal distribution, but this is not relevant. For positional syntax, the order is very important. This code is for the order of terms in my class statement. That is, I write Group followed by Time (Time is then sorted within each Group). proc glimmix ; title2 'test case with three groups and 4 times'; class G T; model y = G|T ; lsmeans G*T ; estimate 'G1 vs rest at T1, pos.' G 1 -0.5 -0.5 G*T 1 0 0 0 -0.5 0 0 0 -0.5 0 0 0 ; estimate 'G1 vs rest at T1, nonpos.' G [1,1] [-0.5, 2] [-0.5, 3] G*T [1,1 1] [-0.5,2 1] [-0.5,3 1]; lsmestimate G*T 'G1 vs rest at T1, nonpos.' [1,1 1] [-0.5,2 1] [-0.5,3 1]; run; All three statements give the same result.

lvm · ‎06-10-2015

With such a broad and general question, you must get the book SAS for Mixed Models, second edition (2006) by Littell et al. You have to buy this, but it is absolutely essential! Anyone doing anything with mixed models needs to have a copy of this book.

lvm · ‎06-10-2015

Both the gamma and exponential are defined only for positive real numbers. 0 is not allowed for either distribution. If you don't have random effects you could use PROC FMM for a mixture model, i.e., for a mixture of Prob(Y=0) and Prob(Y>0), the latter being an exponential. With random effects, you could do a mixture with NLMIXED; see examples in Stroup textbook on generalized linear mixed models. However, I am not convinced you want a gamma distribution. It appears that your response variable has an upper limit of 5. The gamma/exponential has is unbounded on the right. Multinomial may be appropriate, as suggested by Steve, if you have counts of several individuals for all the treatments or covariates. Response could be rescaled to 0-1 by dividing by the max (5). Then you have a beta distribution. But then you still have the undefined Prob(Y=0). Adding a constant (c) to Y technically allows you to use an exponential distribution. But Y+c will not have the same distributional properties as Y. Consider the exponential. It is defined by a parameter b, the mean; the variance is, by definition, b^2=mean^2. For Y+c, the mean is now b+c, but the variance is unchanged. So, it is impossible for mean^2 to equal the variance. Use of the gamma can handle this.

lvm · ‎06-04-2015

The gamma distribution has two parameters, mu (which may be a function of covariates and treatments, and associated parameters) and a scale parameter. The scale parameter is like a standard deviation or variance (but not quite); at least it serves that purpose (a measure of variability, similar to sigma with normal data). But with the gamma distribution, the variance is a function of the mean. var(Y) = scale*(mu^2).

lvm · ‎06-04-2015

Both GENMOD and LOGISTIC (and other PROCs) use MLE for a binomial distribution, definitely not OLS. Check out LOGISTIC for great graphics.

lvm · ‎06-03-2015

The variance of the conditional Poisson is the mean, not the inverse of the mean (using standard parameterization). You can use sub=intercept when you want all observations to be correlated, not just within plots (special syntax to show that there is one big subject). You are modeling spatial covariance as a G-side effect, not an R-side one. Thus, the variance of the G side term is not related to the conditional variance of the Poisson. These are different things.

Online Status	Offline
Date Last Visited	‎10-02-2024 05:21 PM

Re: mianalyze of lsmestimate

Re: mianalyze of lsmestimate

Re: TEMPLATE: how to combine the equivalent of LAYOUT LATTICE and LAYO...

TEMPLATE: how to combine the equivalent of LAYOUT LATTICE and LAYOUT D...

Re: SAS code for proc glimmix data - interaction analysis

Re: Mixture of chi square with NLMixed in sas

Re: Stepwise Model Selection for longitudinal binary data using PROc G...

Re: Calculating weight for site effect based on standard error

Re: Estimating treatment effects, 2 Group Pre-Post Matched Analysis

Re: Proc Mix insufficient memory issue

Re: GLIMMIX: order of random variable syntax

Re: mianalyze of lsmestimate

Re: mianalyze of lsmestimate

Re: SAS code for proc glimmix data - interaction analysis

poisson regression goodness of fit stats

Re: Stepwise Model Selection for longitudinal binary data using PROc G...

Re: Reporting T-values in LSMEANS Statement in PROC GLM

Re: proc glimmix fixed effects solution for logistic regression

Re: Lower order terms and interactions involving random effects in pro...

Re: I get the same BIC and -2 Res Log!?

Re: I get the same BIC and -2 Res Log!?

Re: ESTIMATE statement in PROC GENMOD - did I specify things correctly...

Re: ESTIMATE statement in PROC GENMOD - did I specify things correctly...

Re: ESTIMATE statement in PROC GENMOD - did I specify things correctly...

Re: ESTIMATE statement in PROC GENMOD - did I specify things correctly...

Re: ESTIMATE statement in PROC GENMOD - did I specify things correctly...

Re: Proc Glimmix and Proc Mixed output Interpret guidance

Re: PROC GLIMMIX with Gamma Log-Link and 0 counts

Re: question about GLM model SAS code and result interpretation

Re: What is the proper way to estimate a logistic model

Re: Spatial analysis in GLIMMIX