About cmo5

cmo5 · ‎01-06-2020

In my previous post, I meant to say that LSMEANS does not work in "multinomial" rather than "multivariate" models.

cmo5 · ‎01-06-2020

Hi Paige, Thanks for the reply. Here's my edited syntax and the error that I received in GLIMMIX when specifying LSMEANS. proc glimmix data=three method=laplace empirical=mbn noitprint noclprint ; where PacketN>2; class Subj Female AscendLimb; model DriveDist (order=data)= Female cBase_AIDattitudes AscendLimb / DIST=MULTINOMIAL LINK=CLOGIT SOLUTION CL ddfm=bw oddsratio(DIFF=LAST LABEL); random intercept / sub=Subj TYPE=VC; lsmeans Female AscendLimb / ilink ; covtest / WALD; run;

cmo5 · ‎01-05-2020

Hi Paige, Thank you for the quick and helpful response. It looks like the LSMEANS statement doesn't work in multivariate models. I plan to use the class statement for the dichotomous independent variables. It seems that the ODDSRATIO statement may give the same results for contrasting males vs. females and ascending limb vs. descending limb. The ESTIMATE statement might be another option, though I'm not certain on the specifics on this yet. The reason I'm interested in the intercepts is that I'm hoping to statistically examine the distribution of responses across levels of the DV (distance participants were willing to drive while intoxicated, which has 4 levels: 0 miles, 1 mile, 3 miles, 10 miles). For example, when blood alcohol level is descending (when limb = 1, descending) how much further are participants willing to drive? That is, after accounting for the IVs in the model, is the "typical participant" willing to drive 3 or more miles on the descending limb, but only 0 or 1 mile on the ascending limb? If I center on the limb I'm interested in and center other variables at their means, can the intercepts be interpreted this way in a GLMM? I realize that the cumulative logit model is calculating the effect across all levels of the DV, but is there a way to distinguish between levels of the DV (i.e., proportion of participants at a certain level of the DV or below is more than the other levels of the DV). This SAS paper (https://support.sas.com/resources/papers/proceedings15/3430-2015.pdf) makes me think that I can interpret the intercepts, but I'm still a still a little uncertain whether I should be doing this, which levels of the DV the intercepts would be distinguishing, if so, and whether the p values associated with the intercepts are informative in drawing conclusions about what the "typical participant" (all IVs centered) did.

cmo5 · ‎01-03-2020

I'm wondering if there are any recommendations on whether to code two-level categorical variables as continuous variables or class variables (dummy coded) in a PROC GLIMMIX model with a multinomial outcome. I would prefer to use effect coding with sample-centered IVs so that the intercepts are easier to interpret, but I've found mixed information on whether it's appropriate to interpret intercepts at all in multinomial models. My data have two levels, drinking observations measured across 6 timepoints (3 on ascending limb of blood alcohol curve, 3 on descending limb) nested within subjects. Driving Dist = how far participants were willing to drive, 4 level DV (0 miles, 1 mile, 3 miles, 10 miles) cSex = sex, centered on grand mean (unequal sample sizes of men and women, so mean is near, but not exactly zero) cMale = sex, centered on males = 0 (females = 1) cBase_Attitudes = baseline attitudes about drinking and driving, continuous IV Limb = limb of blood alcohol curve, centered on ascending limb = 0 (descending = 1) Here's two versions of my syntax: *Effects coding.; proc glimmix data=three method=laplace empirical=mbn noitprint noclprint ; where PacketN>2; class Subj ; model DriveDist (order=data)= cSex cBase_AIDattitudes Limb / DIST=MULTINOMIAL LINK=CLOGIT SOLUTION CL ddfm=bw oddsratio(DIFF=LAST LABEL); random intercept / sub=Subj TYPE=VC; covtest / WALD; run; *Dummy coding.; proc glimmix data=three method=laplace empirical=mbn noitprint noclprint ; where PacketN>2; class Subj cMale Limb; model DriveDist (order=data)= cMale cBase_AIDattitudes Limb / DIST=MULTINOMIAL LINK=CLOGIT SOLUTION CL ddfm=bw oddsratio(DIFF=LAST LABEL); random intercept / sub=Subj TYPE=VC; covtest / WALD; run; My questions are: 1) Is it appropriate to treat sex as a continuous IV centered on the sample, so that the effects are interpreted as when accounting for sex (and the unequal weighting in the sample), rather than dummy-coding (which results in interpreting other effects as for only men or women)? 2) Is it appropriate to treat limb of the blood alcohol curve as a continuous IV, centered on whichever limb I'm interested in? Most likely, I would be interested in interpreting the intercepts (how far participants were willing to drive) on both limbs, and would run the model twice, once when centered on each of the ascending and descending limbs. ***For questions 1 and 2, the -2LL and slopes do not seem to change with either type of coding unless I include a random slope in the models. 3) Is it appropriate to interpret intercepts in multinomial regression similar to regular multivariate regressions (i.e., likelihood of driving x distance when all other predictors are zero). 4) With cumulative logit models, are the intercepts always distinguishing between the highest ordered value and the lower categories? Or are they distinguishing between anything falling above or below a particular cutoff? For example, if 0 miles is the highest ordered category and 10 miles is the lowest, the intercept for "3 miles" the difference between the likelihood of driving: a) 0 miles versus 3 miles or more, OR, b) 0-1 miles versus 3 miles or more? Any recommendations on any of these questions are much appreciated!

cmo5 · ‎12-17-2019

I realize this post is 3 years-old (and the original poster had already resolved the issue him/herself), but I have found myself in a similar situation, stumbling across a SAS community question that perfectly replicates the question I'm trying to sort out, with no final resolution. Just in case others are struggling to find syntax to create a count variable, here is some code to help: First, sort data by your grouping variable and the variable you want counted. ID = grouping variable; Date = ordering variable proc sort data=one; by ID Date; run; Next, create a count variable in a new data set. In this case, each first ID starts at a count of 1, and continues to count up until the next ID. data two; set one; *Create count variable.; Count + 1; by ID; if first.ID then Count = 1; run; If you wanted to create a running count within more than one category, for example, number of observations within each participant and date, you can simply add the additional variable to your count syntax after the "by" statement (e.g., "by ID Date"). If you need something slightly more complicated, for example, counting only certain types of observations, some additional syntax is listed below. data two; set one; *Create drinking observation count.; if Drinking=1 then Count + 1; by ID; if first.ID then Count = 1; run; Here, I created a count of drinking observations (only when drinking = 1, not drinking = 0) within each participant ID.

Online Status	Offline
Date Last Visited	‎01-21-2020 03:15 PM

Re: Effects coding versus dummy coding in with two-level categorical v...

Re: Effects coding versus dummy coding in with two-level categorical v...

Re: Effects coding versus dummy coding in with two-level categorical v...

Effects coding versus dummy coding in with two-level categorical varia...

Re: HELP NEEDED ASAP: Counting consecutive variable values

Re: Effects coding versus dummy coding in with two-level categorical v...

Re: Effects coding versus dummy coding in with two-level categorical v...

Re: Effects coding versus dummy coding in with two-level categorical v...

Effects coding versus dummy coding in with two-level categorical varia...

Re: HELP NEEDED ASAP: Counting consecutive variable values