Re: Differences in lsmean

ZC · Posted 05-22-2013 11:58 AM

Hello,

I have a stacked data set mydata with below the structure:

1. some are repeatedly measured and others are not;

2. sbp_change = sbp_v2 - sbp_v1;

id group sbp_v1 sbp_v2 sbp_change

1 1 113 116 3

1 1 110 119 9

2 1 124 115 -9

3 2 125 126 1

3 2 126 134 8

...

%macro mean (a, b, c, d ,e, f, g, h);

PROC MIXED DATA = mydata METHOD=ML ABSOLUTE CONVF = 0.00001 NOCLPRINT;

CLASS ID group;

MODEL &a = group/NOINT;

LSMEANS group /CL;

REPEATED INT / TYPE = CS SUBJECT = ID;

RUN;

%mend;

%mean (sbp_v1)

%mean (sbp_v2)

%mean (sbp_change)

Theoretically, the lsmean sbp_change should be equal to the difference between lsmean sbp_v2 and lsmean sbp_v1. But my analyis results showed they are not equall. What is the reason?

Thanks,

ballardw · Posted 05-22-2013 01:39 PM

Any chance that sbp_v1 and sbp_v2 have missing values? If so would only one be missing for some?

Your sbp_change would be missing when either of the other two are missing. So there could be different numbers of records used in each model.

ZC · Posted 05-22-2013 01:46 PM

Thanks for your response. In my data, all the three variables (sbp_v1, sbp_v2 and sbp_change) are either missing or non-missing at the same time.

SteveDenham · Posted 05-22-2013 01:50 PM

Well, that shoots down my major idea as to what was causing the non-equality.

Can you share the lsmeans (and standard errors) for each of the three variables?

ZC · Posted 05-22-2013 02:23 PM

Results from the current SAS syntax:

mean (SE)

sbp_v1: 110.18 (0.42)

sbp_v2: 111.68 (0.58)

sbp_change: 0.67 ( 0.47)

If I remove the "subject = id" from the current SAS syntax, the lsmean sbp_change equal to the difference in lsmean sbp_v2 and sbp_v1:

mean (SE)

sbp_v1: 109.43 (0.48)

sbp_v2: 110.43 (0.63)

sbp_change: 1.00 (0.51)

SteveDenham · Posted 05-23-2013 08:47 AM

Then the reason must be in the use of the REPEATED statement, and the implied ordering within subject. The correlations off diagonal aren't the same, but I was under the impression that this should only affect the standard errors. Obviously, it means that the marginal and conditional estimates are not the same. Without subject=id, the estimates are marginal, with subject=id, they are conditional on the ordering. (I THINK.) If you wanted to check on this, you could reorder the observations within subject, and see if it had any effect.

What happens if you replace the REPEATED with RANDOM? The code would look like:

PROC MIXED DATA = mydata METHOD=ML ABSOLUTE CONVF = 0.00001 NOCLPRINT;

CLASS ID group;

MODEL &a = group/NOINT;

LSMEANS group /CL;

RANDOM INT / TYPE = CS SUBJECT = ID;

RUN;

Message was edited by: Steve Denham

deb193 · Posted 05-23-2013 09:58 AM

If the net result of missing data is that not all subjects have the same number of observations, I think this alone can produce different LSMEANS.

Urgent : Differences in lsmean