From Frank Harrell's online version of Regression Modeling Strategies, I pulled this from the annotated bibliography:
Kung-Yee Liang and Scott L. Zeger.“Longitudinal Data Analysis of Continuous and Discrete Responses for Pre-Post Designs”. In: Sankhy¯a 62 (2000). makes an error in assuming the baseline variable will have the same univariate distribution as the response except for a shift; Baseline may have for example a truncated distribution based on a trial’s inclusion criteria. If correlation between baseline and response is zero, ANCOVA will be twice as efficient as simple analysis of change scores;if correlation is one they may be equally efficient, pp. 134–148 (cit. on p. 7-5).
That last sentence lets you know that fitting a change score, with the baseline as a covariate, is never as good as fitting the actual data. Once you get your marginal means for treatment groups, you can calculate change from baseline.
Another way to think about this is to just do some simple algebraic rearrangement. You have this model:
Change = Bo + B1*Y1 + B2*Grp2 +B2*Grp3. Plugging in the definition of Change, you get:
Y2 - Y1 = Bo + B1*Y1 + B2*Grp2 +B2*Grp3. Now add Y1 to both sides, and get:
Y2 = Bo + B1*Y1 + B2*Grp2 +B2*Grp3 + Y1. Rearranging terms, this gives:
Y2 = Bo + (B1 + 1)*Y1 + B2*Grp2 +B2*Grp3. Redefine B1 + 1 as B1' and you get:
Y2 = Bo + B1'*Y1 + B2*Grp2 +B2*Grp3. But this holds if and only if there is no correlation between Y1 and Y2. If there is correlation (and there usually is in most pre/post designs), then the estimates for B2 and B3 will be biased as a function of the amount of correlation in the response variables after removing the true effects of B2 and B3..
SteveDenham
... View more