About SteveDenham

SteveDenham · ‎04-15-2021

Stroup and Claassen recently published a paper comparing pseudolikelihood methods to quadrature methods. In particular, they spend a lot of time on binomial response models. Their findings indicate that for most cases, pseudolikelihood (linearization) performs at least as well, if not better, than quadrature. Here is the reference: Journal of Agricultural, Biological and Environmental Statistics volume 25, pages 639–656. This is freely available if you have an ASA membership, but is otherwise behind a paywall. SteveDenham

SteveDenham · ‎04-06-2021

Thanks @PGStats . That is a great job. I'll go back and edit my post to note that the method does not do what I thought it might. SteveDenham

SteveDenham · ‎04-05-2021

Well, here is something that does, well, something. EDIT: It turns out that the "something" is NOT what I said it might be. See the post by @PGStats for an excellent proof of this. I create a new weight as the product of a uniform random variate and the weight variable in the dataset divided by the sum of all the weights, sort the dataset in descending order by the new variable, and then select the first 12 id numbers in the sorted dataset.. data one; input Unit_ID weight @@; datalines; 1 237.18 2 567.89 3 118.50 4 74.38 5 1287.23 6 258.10 7 325.36 8 218.38 9 1670.80 10 134.71 11 2020.70 12 47.80 13 1183.45 14 330.54 15 780.10 16 895.80 17 620.10 18 420.18 19 979.66 20 810.25 21 670.85 22 314.58 23 87.50 24 1893.40 25 753.30 26 540.65 27 2580.35 28 230.56 29 185.60 30 688.43 31 505.14 32 205.48 33 650.42 34 1348.34 35 30.50 36 2214.80 37 940.35 38 217.85 39 142.90 40 806.90 41 560.72 ; data two; set one; call streaminit(452021); ranno1=rand('uniform'); ranno2=ranno1*weight; run; proc means data=one noprint; var weight; output out=totsamp sum=sum; run; data combined; if _n_=1 then set totsamp; set two; drop _type_ _freq_; relsize=ranno2/sum; run; proc sort data=combined out=three; by descending relsize; run; data four; set three; if _n_<=12; run; I am not sure about the optimality of this method at all. Relsize is a product of the (assumed) probability of selection (=ranno1) and the proportion of the total weight each ID contributes (=weight/sum). The first 12 are then the most likely IDs to be selected, and the procedure is such that once an ID is selected, it cannot be selected again. I suppose iteratively reweighting would be better, which would loop through, selecting the ID with the largest relsize, removing it from dataset one, recalculating the total weight, and the proportion of the total weight, multiplying this by the random number, resorting, selecting the ID with the largest relsize under this condition, removing it, and going through this until 12 IDs had been selected. SteveDenham

SteveDenham · ‎04-01-2021

Learned something big there - that the summarizing options from PROC MEANS are available in PROC SQL. SteveDenham

SteveDenham · ‎03-29-2021

Spoiler alert: This method has not been tried on real data, and might not quite be what you are looking for. I think you can do this using PROC NLMIXED. Check the example in the documentation here: https://documentation.sas.com/?cdcId=pgmsascdc&cdcVersion=9.4_3.4&docsetId=statug&docsetTarget=statug_nlmixed_examples05.htm&locale=en for fitting a failure time/frailty model. What you would have to do is incorporate a segmented model using nutrient level as a continuous variable. Code to get started is in the NLIN example here: https://documentation.sas.com/?cdcId=pgmsascdc&cdcVersion=9.4_3.4&docsetId=statug&docsetTarget=statug_nlin_examples01.htm&locale=en In any case, Mr. Google is your friend. I got several results on a search for "failure time" and "nlmixed", so there should be something out there. SteveDenham

SteveDenham · ‎03-29-2021

@Naviava1973 , I missed that your response was already log transformed. That means that for the first method I mentioned, the lsmeans need to be preprocessed by exponentiating. The same would apply to any confidence bounds. The second method could be applied directly to the lsmeans you currently obtain as well as any confidence bounds. As @jiltao said, you would have to use NLMIXED to get valid standard errors for the means on the original scale. SteveDenham

SteveDenham · ‎03-26-2021

I can think of two ways, but neither is done entirely within PROC MIXED. I'll list them in order of probable appropriateness. Method 1: Get the lsmeans and differences into datasets, merge them and post process it, dividing the difference from pretreatment for each post treatment time lsmean by the pretreatment time lsmean and multiply by 100. Method 2: Shift to PROC GLIMMIX, use a log link. The differences will be ratios which are "fold" changes. From there convert to percentage changes by calculating pct_change = 100* (diff in lsmeans -1). What you should NOT do is divide the values by each pretreatment value and analyze the ratios as if they were normal. This ignores all covariance within subject, plus ratios of variables with normal errors generally do not have normal errors. SteveDenham

SteveDenham · ‎03-25-2021

I am going to vote no on being a good fit. You would really like the Chi squared/DF ratio to be close to one, and here it is over 4 million. So either your model or your distribution is inappropriate. SteveDenham

SteveDenham · ‎03-24-2021

This is just a wild-eyed guess, but you might look into using the INSTRUMENTS statement, with 1/sd defined in the dataset as a variable that gets included via the INSTRUMENTS statement. Best bet would be to follow @Ksharp 's advice and post in the Forecasting community. SteveDenham

SteveDenham · ‎03-23-2021

The least squares mean is a point estimate of the population mean for that group. There are no individually calculated values - instead, there is a single solution to matrix equations. The equivalent of a box plot would be a plot of the LS-mean with confidence bounds. If you get very creative, you can overlay the LS-mean estimates on to a box plot of the variables using PROC SGPLOT. You might get an answer for how to do that in the Graphics Programming community. SteveDenham

SteveDenham · ‎03-23-2021

If you think of this as a non-inferiority test, I think the following may be of interest: proc ttest data=weight2 h0=-7 sides=L; class species; var weight; run; This uses your 'long' dataset. The output shows that the difference is significantly less than -7. SteveDenham

SteveDenham · ‎03-23-2021

A factor model is of the type: model Y = A B A*B C A*C B*C A*B*C which can also be written as model Y= A|B|C A means model is a one-way analysis where model Y=A*B*C Comparison tests and lower order effects are then obtained from either CONTRAST or LSMESTIMATE statements. So using a factor model, you would have something like this: PROC GLIMMIX data=data; class site origin block cut tree; model recovery = site|origin|cut/ddfm=kr; random block block*site block*origin block*cut block*site*origin block*site*cut block*origin*cut block*origin*site*cut;; run; There are 8 random effects here, and unless you have about 10^8 data points, at least one will probably be estimated to be zero. You could test them by using the COVTEST statement with the TESTDATA option. Something to consider is that three-way and higher interactions for random effects are generally indistinguishable from residual error, and fitting them is probably not worthwhile without that really big data set. I hope this helps some. SteveDenham

SteveDenham · ‎03-12-2021

The model presented from 10 years ago is a fairly standard parameterization of the 3 parameter logistic growth model, with asymptotes at 0 (min) and k (max). A common use is plant height as a function of about anything, actually - water availability, fertilizer applied, etc. SteveDenham

SteveDenham · ‎03-11-2021

Well, first off, the MODEL statement you have here isn't linear in the parameters (although it could be if you took the log on both sides), so the standard Rsquared is probably not valid. CrossValidated has a bunch of replies about these, but the one that makes the most sense to me is McFadden's likelihood based pseudo Rsquared. To get that, try fitting your model using PROC NLMIXED. The null model that you would compare to would be model cumgerm = ; I chose this as the model because setting b0 to zero makes the right hand side identically zero for all values of gdd, and this achieves the same thing I hope. If not, then move the param values to direct code values like the P=0.85 with all set to 0. Once you have both log likelihoods you can use 1 - LL(fit model)/LL(null model) to calculate the pseudo Rsquared. Having both the response and explanatory variables as continuous actually makes this a bit easier. SteveDenham

SteveDenham · ‎03-11-2021

Hi @LaurenMeta , Having your code in text format makes it very difficult to follow or even read in some cases. Could you repost, with your code in the "Insert SAS code" box? The icon that looks like is what will trigger that. The </> icon will also work. If you have issues with PROC GLIMMIX, please also show your complete log file, so that we can trace any errors back to the source. SteveDenham

Online Status	Offline
Date Last Visited	‎03-19-2026 03:00 PM

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Randomized block design and meaning of LSMEAN/STDERR

Re: Help with Restricted Cubic Splines : Code Optimization and Graphic...

Re: PROC POWER for Cox regression

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Model heteroscedasticity directly or use log transformation

Re: Model heteroscedasticity directly or use log transformation

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: Passing TESTVALUE option in LSMESTIMATE statement in glimmix

Re: What is the "estimate" in the SolutionR output of the proc mixed. ...

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: SAS OnDemand Outpm option in Proc Mixed

Re: question about lsmean pdiff=;option in proc glm step, st102d03, SA...

Re: Proc mixed, defining data structure for desired comparison (Random...

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Assessing Variable Redundancy for Mixed Effects Modeling

Re: Help with Restricted Cubic Splines : Code Optimization and Graphic...

Re: Repeated measures model executes in MIXED but not in GLIMMIX

Re: GLIMMIX 3-level model stops with "insufficient memory" error...2-l...

Re: Selecting a weighted sample without replacement

Re: Selecting a weighted sample without replacement

Re: Help on PROC GENMOD

Re: How to Find the Optimal Level of a Predictor Variable in Proc PHRE...

Re: PROC Mixed treatment effect percentage change

Re: PROC Mixed treatment effect percentage change

Re: Help on PROC GENMOD

Re: Proc Panel with regression weight

Re: How are LSMeans calculated in SAS?

Re: How to test one-sided non-zero null hypothesis in proc ttest

Re: How to analyze RCBD with 3 factors

Re: nlin&goodness of fit

Re: Pseudo R-square for NLIN

Re: Novice SAS user needs help with PROC GLIMMIX

SAS Analytics Explorers