About SteveDenham

SteveDenham · ‎03-11-2021

The code ought to converge, so it becomes a matter of whether or not your questions of interest can be addressed. Unless you have missing cells, you will probably be happier fitting a factor model rather than a means model. Change the '*" to '|' in your model statement. This will give F tests for main effects, two-way interactions and the three-way interaction. Then you can look at block and specific block interactions as random effects in the RANDOM statement, eliminating those that have a zero variance component. Alternatively, you can stay with the means model and through the use of the LSMESTIMATE statement with a JOINT option, create the tests equivalent to those in the factor model. Harder to do in most cases. SteveDenham

SteveDenham · ‎03-11-2021

Thanks @STAT_Kathleen . I really appreciate the knowledge about the multinomial in GLIMMIX. I wish that it appeared in documentation of the DIST= option or in one of the examples, It seems that sometimes an error message is how I learn something critical - this really helps, SteveDenham

SteveDenham · ‎03-10-2021

The only thing that strikes me as unusual is using GROUP=dv. I think this may be confounding things to the point that all the variation is captured in the comparison A vs B. A possibility would be to fit a NOINT option in the MODEL statement. This should move the variance currently attributed to the intercept, and thus in residual error, to the comparisons of interest. However, that is just a shot in the dark. SteveDenham

SteveDenham · ‎03-10-2021

Maybe PROC PANEL? Try posting this in the Forecasting and Econometrics forum. it is specifically designed for questions of this sort. SteveDenham

SteveDenham · ‎03-10-2021

Here is a version using PROC MIXED. It yields the same results as the PROC GLIMMIX code. proc mixed data=one; class trt id site; model result=trt site*trt/solution ddfm=bw; repeated site/ subject=id(trt) type=cs; lsmeans trt/diff; lsmeans trt*site; run; SteveDenham

SteveDenham · ‎03-09-2021

The thing to remember is that the following two statements are equivalent as far as MM reporting: random intercept/subject=block; random block; It is just that the first statement results in faster calculations, and is required if method=quad But it still boils down to "Block was fit as a random effect." And random intercept/subject=block(location) is "Location within block was fit as a random effect." So boiling this all down, would this make sense?: "Treatments were randomly assigned within blocks, and multiple locations within each block were measured. Treatment was fit as a fixed effect, while block and location within block were fit as random effects." You could probably look through the ecology or psychology literature to find examples using the phrase 'random intercept', especially where R packages were used to analyze the results. Those might be enough to get you started. SteveDenham

SteveDenham · ‎03-09-2021

If you came to me for an analysis, I would consider this to be a linear mixed model problem, with repeated measures. To analyze this you would need to get your data into a long format, with a single response value for each record, along with the design factors: ID, treatment (compressed/uncompressed) and site. Possible code would look something like (data is simulated to look something like what you have in Table 1): data one; call streaminit(12345); do i=1 to 2; do j=1 to 5; do k=1 to 6; if i=1 then do; trt='Uncompressed'; site=j; result=7 + rand('normal',0,2); end; else do; trt='Compressed'; site=j+5; result=9 + rand('normal',0,1); end; id=k; output; end; end; end; run; proc glimmix data=one; class trt id site; model result=trt site*trt/solution ddfm=bw; random site/residual subject=id(trt) type=cs; lsmeans trt/diff; lsmeans trt*site; run; The method you propose will certainly work: Firstly, we calculated the average for compressed area (site1~site5) and the average for uncompressed area (site1~site5) for each patient, respectively. So, we compared the average between two groups (compressed area vs. uncompressed area) by paired t tests. However, is it correct way? suspect that the p value for the F test for treatment will be very nearly the same as the p value for the paired t. One issue is that the sites within each treatment are not truly repeated (that is, site 1 in the compressed area is not site 1 in the uncompressed area). I would recommend renumbering the sites in one of the areas as 6 to 10. Consequently, the model should not contain a main effect for site, in order to prevent non-estimability for the treatment least squares means. Regarding the missing values question, the mixed model procedures are robust to these so long as the data are missing at random, which is a reasonable assumption. Here is code in case you want to try a generalized estimating approach. Note that the standard error is smaller. proc genmod data=one; class trt id site; model result=trt trt*site/type3; repeated subject=id*site/type=exch; lsmeans trt/diff; lsmeans trt*site; run; Treatment means are the same. SteveDenham

SteveDenham · ‎03-09-2021

There are other reasons to use GLIMMIX aside from data that are non-Gaussian. There is a wider variety of optimizers (tech=) and methods (MIXED restricted to REML, ML and MIVQUE0. GLIMMIX has a functional and flexible COVTEST statement. And there are reasons to use MIXED rather than GLIMMIX, primarily related to the Kronecker product variance/covariance structure for doubly repeated measures. In this case, where there is interest in reporting the random effects, the COVTEST statement may be of particular interest. SteveDenham

SteveDenham · ‎03-08-2021

This is the same question you posed here: Mean Difference of 4-Level Categorical Variable Please see the reply by @StatDave regarding use of the %NLmeans macro. And I think we could do with some additional information, such as what you mean by a standardized mean difference. Are you referring to something like a protected least significant difference? If that is the case, you could only derive such a thing on the logit scale, as on the original scale the standard errors will be different depending on the value of the mean. SteveDenham

SteveDenham · ‎03-05-2021

In particular, how long to fit the full model? I suspect that you may have to make some subject matter decisions regarding the independent variables to get this down to a workable size. SteveDenham

SteveDenham · ‎03-04-2021

Look into the LSMEANS statement - that will give you means and SE's. With on ODS OUTPUT statement, you can put those into a dataset. No direct need for a macro. SteveDenham

SteveDenham · ‎03-02-2021

While I like @StatDave 's response to look at HPGENSELECT, I would suggest a couple of things before you start doing variable selection. Season, treatment, parity and body condition score (BCS) seem to me to be 3 design factors and a continuous covariate, and that covariate (BCS) is well known to have an effect on pregnancy rate in mammals. So in truth you have just four variables, with possible interaction, and no real need to employ variable selection. Try the following MODEL statement: model result(event='pregnant')=season*treatment*parity BCS BCS*season BCS*treatment BCS*parity; This fits a fully saturated model for the design factors, with possible different slopes for the BCS relationship. Work through this to eliminate the interaction terms where the slopes do not differ. Once you have stabilized your selection of appropriate slope terms, you could then fit an effects model, with the relevant covariate/covariate by effect interaction terms in the model. This approach is covered in Milliken and Johnson's Analysis of Messy Data, vol.3: Analysis of Covariance, or in SAS for Mixed Models (any of the editions 1 to 3) in the chapter on analysis of covariance. Also, look at the following crosstabulation: PROC FREQ data=maanshan320; tables parity*season*treatment*result; run; That should give 8 tables that are Nx2, where N is the number of treatments and 2 is the number of levels for result. From those 8 tables, you should readily be able to identify where the separation is occurring, if anywhere, for the design factors. Also, you may want to look at the results of PROC GLM, with BCS as the dependent variable, and the design factors crossed with the result variable as the independent variables. In the LSMEANS statement, see how BCS separates as a result of the factors. I think the root cause of the separation issue is the inclusion of high-order interactions with BCS. For some combination or combinations of the design factors and the response, there are likely to be full separations of the covariate. Additionally, fourth order interactions do a great job of modeling noise, especially when one is a continuous variable, which brings us back to @StatDave 's comment regarding fitting the data perfectly. So think carefully about the biological question at hand (which looks like it might be related to feeding dairy cows and seeing what the resulting pregnancy rate is) and formulate a model that addresses those questions. SteveDenham

SteveDenham · ‎03-02-2021

To add to @PaigeMiller 's response, CORRB would enable you to say something about the collinearity between variable X1 and variable X2, but not about the collinearity between X1 and a linear combination of the other X's. The point I would add is that variable selection is a tricky subject, even for a fixed effects model. For a mixed model, it is even more problematic. What you might consider is to use a LASSO based method, treating all factors as fixed during the selection, and then denoting as random those that appropriately define a broader inference space. SteveDenham

SteveDenham · ‎03-01-2021

Well, I didn't go through all 300+ pages of data, but I did see a lot of what I referred to as "identicalness" in just the first subject. There are levels of subscale and item2 which are identical once you drill down to the levels specified by your BY statement. For instance, for the first subject, all values of RATING are 1, except for subscale n1, item 4. Similar patterns are seen for the second data point for the first subject, where all values of RATING are 1 - no variability. Given this, try METHOD=TYPE1 which should fit the terms sequentially, or loosen your restriction on the BY variables, so that some variability can be seen at the level you are examining. SteveDenham

SteveDenham · ‎03-01-2021

One way to possibly identify the source of the group level without variability would be to look at a cross tab using PROC FREQ such as: PROC FREQ DATA=stacked BY ID ID_Day ID_Day_Point; TABLES Item2*Subscale*Rating; RUN; If it appears that there is no systematic "missingness" or "identicalness" then you may want to consider a method for estimating the variance components that allows for negative values (any of the other methods - TYPE1, MIVQUE0 or ML) and see if the same error arises. SteveDenham

Online Status	Offline
Date Last Visited	‎03-19-2026 03:00 PM

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Randomized block design and meaning of LSMEAN/STDERR

Re: Help with Restricted Cubic Splines : Code Optimization and Graphic...

Re: PROC POWER for Cox regression

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Model heteroscedasticity directly or use log transformation

Re: Model heteroscedasticity directly or use log transformation

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Passing TESTVALUE option in LSMESTIMATE statement in glimmix

Re: What is the "estimate" in the SolutionR output of the proc mixed. ...

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: SAS OnDemand Outpm option in Proc Mixed

Re: question about lsmean pdiff=;option in proc glm step, st102d03, SA...

Re: Proc mixed, defining data structure for desired comparison (Random...

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Help with Restricted Cubic Splines : Code Optimization and Graphic...

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: How to analyze RCBD with 3 factors

Re: GLIMMIX Multinomial GLMM reports all random effects as zero for on...

Re: GLIMMIX Multinomial GLMM reports all random effects as zero for on...

Re: Simultaneous estimation of yield curves

Re: How to analyze the data for measuring multiply site from same pati...

Re: Reporting GLIMMIX random effects in a research journal article

Re: How to analyze the data for measuring multiply site from same pati...

Re: Reporting GLIMMIX random effects in a research journal article

Re: PROC GLIMMIX Categorical Mean Difference

Re: proc PHREG takes too long to assess proportional hazards assumptio...

Re: How to estimate standard error for the reference category in proc ...

Re: A question about quasi-complete separation

Re: covariates selection and multicollinearity in repeated measures ap...

Re: Floating Point Zero Divide Error in VARCOMP

Re: Floating Point Zero Divide Error in PROC VARCOMP

SAS Analytics Explorers