Statistical Procedures

Programming the statistical procedures from SAS
BookmarkSubscribeRSS Feed
gkmzinsou
Calcite | Level 5

Hello,

 

Is it possible to have to an ANCOVA with a change score as a dependent variable with adjustment for baseline?

 

Y1 = pre-test

Y2 = post-test

Grp 1 = Treatment 1 (ref)

Grp 2 = Treatment 2

Grp 3 = Treatment 3

Change = Y2 - Y1 

 

ANCOVA model :  Change = Bo + B1*Y1 + B2*Grp2 +B2*Grp3 

(Knowing that I have 3 treatment groups)

 

Thanks!

6 REPLIES 6
Ksharp
Super User
No. ANOVA hypothesis is iid (i.e. independent individual ). But your 'change' is correlation or paired .
I think you should use PROC MIXED or PROC GLIMMIX .

or try MANOVA , put both y1 and y2 as dependent variables in model .
SteveDenham
Jade | Level 19

From Frank Harrell's online version of Regression Modeling Strategies, I pulled this from the annotated bibliography:

 

Kung-Yee Liang and Scott L. Zeger.“Longitudinal Data Analysis of Continuous and Discrete Responses for Pre-Post
Designs”. In: Sankhy¯a 62 (2000). makes an error in assuming the baseline variable will have the same univariate
distribution as the response except for a shift; Baseline may have for example a truncated distribution based on a
trial’s inclusion criteria. If correlation between baseline and response is zero, ANCOVA will be twice as efficient as
simple analysis of change scores;if correlation is one they may be equally efficient, pp. 134–148 (cit. on p. 7-5).

 

That last sentence lets you know that fitting a change score, with the baseline as a covariate, is never as good as fitting the actual data.  Once you get your marginal means for treatment groups, you can calculate change from baseline.

 

Another way to think about this is to just do some simple algebraic rearrangement.  You have this model:

 

Change = Bo + B1*Y1 + B2*Grp2 +B2*Grp3.  Plugging in the definition of Change, you get:

Y2 - Y1 = Bo + B1*Y1 + B2*Grp2 +B2*Grp3.  Now add Y1 to both sides, and get:

Y2 = Bo + B1*Y1 + B2*Grp2 +B2*Grp3 + Y1.  Rearranging terms, this gives:

Y2 = Bo + (B1 + 1)*Y1 + B2*Grp2 +B2*Grp3.  Redefine B1 + 1 as B1' and you get:

Y2 = Bo + B1'*Y1 + B2*Grp2 +B2*Grp3.  But this holds if and only if there is no correlation between Y1 and Y2.  If there is correlation (and there usually is in most pre/post designs), then the estimates for B2 and B3 will be biased as a function of the amount of correlation in the response variables after removing the true effects of B2 and B3..

 

SteveDenham

jiltao
SAS Super FREQ

This is interesting. Thanks for sharing! What I am not sure about. is what exactly is the issue if the dependent variable and one of the covariates is correlated? 

SteveDenham
Jade | Level 19

Of course we would like to see that the dependent variable is correlated with the independent covariate. Otherwise, it really doesn't add anything to the analysis. The problem I see in this case is that some of the correlation is artificially induced mathematically.  I believe the problem gets worse when there is an interaction between the covariate and any categorical variables in the model - there is a greater chance of calling regression to the mean a covariate by class level interaction.

 

Shoot - it's just that I was taught someplace along the line to not fit both a change score (equivalent to a regression coefficient of -1) and the pretreatment baseline (with regression coefficient = beta_hat).

 

SteveDenham

Ksharp
Super User
That is lead to Analysis of Covariance ? Change Y1 to the right side ? Great idea !
jiltao
SAS Super FREQ

You can use PROC GLM for this --

proc glm data=yourdata;

   class group (ref="1") ;

   model change = baseline group;   ** for an unequal slope model, add the interaction baseline*group;

run;

quit; 

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 6 replies
  • 1694 views
  • 4 likes
  • 4 in conversation