About SteveDenham

SteveDenham · ‎04-19-2022

Could you share the first part of the output for both attempts? The part I would like to see is the material up to and including the iteration history. Thanks, SteveDenham

SteveDenham · ‎04-19-2022

Great answer @StatsMan . I had forgotten all about PROC SIMNORMAL. SteveDenham

SteveDenham · ‎04-19-2022

That code will not run, as method=quad does not support a RANDOM residual structure. Two choices: remove the residual option from the new random statement, or leave it in and drop back to the default pseudo-likelihood method. The first will coincide with more of the published literature, while the second may be more appropriate in controlling Type I error (see Stroup & Claassen https://econpapers.repec.org/RePEc:spr:jagbes:v:25:y:2020:i:4:d:10.1007_s13253-020-00402-6 ) (paywall. I have a copy that I seem to have been able to access through my ASA membership) SteveDenham

SteveDenham · ‎04-18-2022

Sure. Make sure you have specified the G option in your RANDOM statement. Then ODS output can be used for this. You have: ods output CovParms=CovParms SolutionF=SolutionF Change this to: ods output CovParms=CovParms SolutionF=SolutionF G=Gmatrix and you will have a dataset called Gmatrix. NOTE: This may look unusual (lower triangular in long form or some such). If it looks right, you can use it directly in code for simulating MVN data (I hope that is correct, @Rick_SAS). The same applies to the V matrix - just be sure you have the V option specified for the RANDOM statement. SteveDenham

SteveDenham · ‎04-18-2022

The first question in my mind is about the macro code RCS_Reg.sas. What is going on with it? Without access to that code, I don't see how we can help you accomplish what you want. Perhaps we could use PROC TPSPLINE, or the EFFECT and EFFECTPLOT commands in one of the PROC's which support them, such as GENMOD. Something like the second example in the EFFECTPLOT documentation might serve as an example. The reason I am jumping to that example is that you get a plot of the dependent variable as a function of the other variables, rather than a plot of the difference as a function of one of the variables. The issue with that particular approach is how to deal with the other variables in the model. So, I get to using PROC SGPLOT, with the PBSPLINE statement. Try something like: proc sgplot data=sashelp.gas; pbspline y=log_valu x=meat / nknots=3; run; SteveDenham

SteveDenham · ‎04-18-2022

You might try a google search on "Bayesian methods for diagnostic tests" to see what other folks in this area may have tried. That 10% incidence of the Xray variable may make this very difficult to establish a relationship, especially in terms of how many subjects might be needed to get reasonable credible intervals on the probability. SteveDenham

SteveDenham · ‎04-18-2022

Let's go through these by number: My questions are: Does my action plan for addressing this exploratory question seem correct? Not really sure why you collapsed a multinomial response down to a binomial, but I think your approach is defensible. It seems like a good start, in any case. Am I correct in keeping only those participants that have both a baseline visit, as well as visit 5 assessment for the final analysis dataset, and excluding the rest? Or is this problematic? If yes, what are some correct alternatives? This is certainly a case where, if the data are MNAR (missing, not at random), you might want to restrict the analysis to the complete records. However, if your data are MAR or MCAR, the maximum likelihood analysis can handle the missing values. Are there any pre-modeling visualization techniques that I can/should use to further explore my data? Is it ok to use boxplots to look at look at the distribution of my continuous variable at each level of the binary outcome? Should I maybe use point-biserial correlation first to see if there’s any evidence of a relationship at all between my predictor and dependent variable before fitting the model? If yes, is there such a thing as point-biserial correlation for repeated measures data, or should I just use the baseline values of the variables? What do you expect to learn from the boxplots? The point-biserial issue can be addressed by a cluster approach--plot time vs independent variable with the binary outcome as two different colors - see the second example in PROC FASTCLUS as an approach.. Is my model setup correct/complete? Your model assumes that the values at the various visits are not correlated. You may wish to impose some sort of covariance structure on visit. How can I check to see if my model fits the data well? I know that for regular linear models, there’s residual plots, QQ plots, check for outliers and influential points etc. But not sure what kind of model diagnostics are best for GLMMs? This is undoubtedly one of the hardest questions in GLMMs. You have two outcomes, so you may want to look at cutpoints in your model for classification of false positives and false negatives (ROC curve). GLIMMIX doesn't do this automatically like LOGISTIC, but I am sure there are examples on the web of how to do this with DATA steps and PROC SGPLOT. Any other suggestions/recommendations? Not at this stage, but eventually you will probably want to work with the multinomial response. At that point, things get much fuzzier (in my opinion). SteveDenham

SteveDenham · ‎04-18-2022

It will be easier to explain how to get a G matrix from the solution if you include the G option after the slash in the RANDOM statement. It may even be obvious where each fit, at least for UN and CS. It is definitely more difficult for more complex variance covariance structures. SteveDenham

SteveDenham · ‎04-14-2022

Recall that for a Poisson distribution the mean equals the variance. By specifying RANDOM residual without indicating a subject you have confounded the R side variance with the residual variance. With 3 visits per subject, perhaps this code will help (no guarantee, though): proc glimmix data= mydata; class id Gender visit_no ; nloptions maxiter=2000 tech=nrridg; model Length_of_ICU_stay= gender Age varA visit_no/ dist=poi link=log solution; random visit_no/type=cs residual subject=id; output out=gmxout pearson=pearson; run /* Or this, for a G side approach */ proc glimmix data= mydata method=laplace; class id Gender visit_no ; nloptions maxiter=2000 tech=nrridg; model Length_of_ICU_stay= gender Age varA / dist=poi link=log solution; random visit_no/type=cs subject=id; output out=gmxout pearson=pearson; run;; I chose a compound symmetry variance structure as I doubt that the visits are equally spaced in time, even if indexed as 1, 2 and 3. A heterogeneous compound symmetry (csh) may give a superior fit, but simpler is usually better and has a higher chance of converging. I also added an NLOPTIONS statement that could result in convergence, if you are hitting up against the default 20 iteration limit. SteveDenham.

SteveDenham · ‎04-08-2022

The only way I can think of is to use PROC GLMMOD to create a dataset with dummy variables for the main effects and interactions, and then use that data set as input to PROC CALIS. SteveDenham

SteveDenham · ‎04-06-2022

Check the GENMOD documentation. In the Details section, it gives the methods for calculating confidence intervals by the two available methods (likelihood ratio and Wald). Neither of those use the closed formulas you present. SteveDenham

SteveDenham · ‎04-06-2022

Specifying TYPE=HF in the REPEATED statement can accomplish this. However, I suspect you missed the point I was trying to make. Consider other covariance structures - I really doubt that HF will lead to the smallest corrected AIC - it will probably not be the best fit for your model. This avoids making a sphericity assumption, which is really not necessary (see the chapter on Repeated Measures in any edition of SAS for Mixed Models for a discussion). SteveDenham

SteveDenham · ‎04-05-2022

I can answer the second question. In the ESTIMATE statement, the value entered is a multiplier of the value. Think of a matrix product where L'(beta) is the estimate you want to calculate. The entries in the L matrix are what you enter into the ESTIMATE statement, so intercept 1 gives you 1*(beta hat for intercept). SteveDenham

SteveDenham · ‎04-05-2022

One of the reasons for using PROC MIXED is that the F tests are valid whether or not the sphericity assumption is met, provided a valid covariance structure is fit and that there are no issues with the convergence. If you are truly concerned about sphericity, then you should stick with PROC GLM, and not use ML/REML methods. SteveDenham

SteveDenham · ‎03-30-2022

This approach is great and @SAS-questioner should consider it. There is a lot of concern about assumptions that have been brought up that really will not affect what conclusions can be drawn. One change to consider is to only include main effects and two-way interactions in the MODEL statement. Including the three-way is the cause of many of the WARNINGs seen so far. One thing want to emphasize to @SAS-questioner : DO NOT USE PROC GLM TO TEST HYPOTHESES IN THE SPLIT-PLOT (REPEATED MEASURES) DESIGN, WITHOUT BEING PREPARED TO DO POST PROCESSING. The wrong denominator with wrong degrees of freedom is used in the tests for the whole plot effects. See any of the editions of SAS for Mixed Models for coverage of this. This design is very much in the wheelhouse of generalized estimating equations (GENMOD and GEE), and I believe your best marginal analysis will be found using that approach, while your best conditional analysis is through one of the mixed model procedures. SteveDenham

Online Status	Offline
Date Last Visited	‎03-19-2026 03:00 PM

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Randomized block design and meaning of LSMEAN/STDERR

Re: Help with Restricted Cubic Splines : Code Optimization and Graphic...

Re: PROC POWER for Cox regression

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Model heteroscedasticity directly or use log transformation

Re: Model heteroscedasticity directly or use log transformation

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Passing TESTVALUE option in LSMESTIMATE statement in glimmix

Re: What is the "estimate" in the SolutionR output of the proc mixed. ...

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: SAS OnDemand Outpm option in Proc Mixed

Re: question about lsmean pdiff=;option in proc glm step, st102d03, SA...

Re: Proc mixed, defining data structure for desired comparison (Random...

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Help with Restricted Cubic Splines : Code Optimization and Graphic...

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Proc mixed, Convergence criteria met but final Hessian is not posi...

Re: Construct G matrix for a mixed model with random effects

Re: correlation between binary and continuous variable with PROC GLIMM...

Re: Construct G matrix for a mixed model with random effects

Re: Restricted cubic splines in Multivarate Linear Regression

Re: Is there any Statistic method which could reflect the diagnose val...

Re: correlation between binary and continuous variable with PROC GLIMM...

Re: Construct G matrix for a mixed model with random effects

Re: Deciphering a Note in the SAS Log When Running PROC GLIMMIX with p...

Re: Proc Calis: SEM with interaction term for binary latent variables

Re: Calculation of confidence intervals for estimate intercept

Re: SAS Proc Mixed for Repeated Measures Design with G-G and H-F adjus...

Re: Calculation of confidence intervals for estimate intercept

Re: SAS Proc Mixed for Repeated Measures Design with G-G and H-F adjus...

Re: If I have multiple group within person, how could I conduct repeat...

SAS Analytics Explorers