BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Jimbo
Calcite | Level 5

Apologies if this is duplicate question, but I can't find anything similar using search.

I have a data set containing a continuous variable, isotope GFR (which should remain stable) measured at time 0 and at 4 months.  There are three categorical outcome states assigned at four months (stable, improved, deteriorated).  I'm trying to consider differences in baseline variable between outcome groups.

Using scatter and Galton plots it appears there is a degree of regression to the mean in the measured variable.  My thought on accounting for this is to use ANCOVA as below.  However, my stats background is limited and I'd be very grateful for any comments as to the appropriateness of this method and / or advice on how to better handle this.

Many thanks

Jime

PROC GLM DATA=Date;

  CLASS RESPONSE;

  MODEL GFR_0 =RESPONSE (GFR_4-GFR_0)   ;

  LSMEANS RESPONSE / ADJUST=TUKEY PDIFF ;

RUN;

1 ACCEPTED SOLUTION

Accepted Solutions
SteveDenham
Jade | Level 19

Try changing the model statement to:

MODEL GFR_4=RESPONSE GFR_0;

This will give the four month value as adjusted for the time 0 value.

You should probably have a preliminary step, to check the homogeneity of slopes across the RESPONSE categories (see Milliken and Johnson's Analysis of Messy Data III: Analysis of Covariance).

So first fit:

MODEL GFR_4=RESPONSE GFR_0 GFR_0*RESPONSE;

and check the significance of the interaction term.  If it is non-significant, then the MODEL statement I gave at first is appropriate.  If it is significant, then the differences need to be calculated at a minimum of three time zero values (low, median, high) using multiple LSMEANS statements and the AT= option (check the documentation on how to do this).

Steve Denham

View solution in original post

2 REPLIES 2
SteveDenham
Jade | Level 19

Try changing the model statement to:

MODEL GFR_4=RESPONSE GFR_0;

This will give the four month value as adjusted for the time 0 value.

You should probably have a preliminary step, to check the homogeneity of slopes across the RESPONSE categories (see Milliken and Johnson's Analysis of Messy Data III: Analysis of Covariance).

So first fit:

MODEL GFR_4=RESPONSE GFR_0 GFR_0*RESPONSE;

and check the significance of the interaction term.  If it is non-significant, then the MODEL statement I gave at first is appropriate.  If it is significant, then the differences need to be calculated at a minimum of three time zero values (low, median, high) using multiple LSMEANS statements and the AT= option (check the documentation on how to do this).

Steve Denham

Jimbo
Calcite | Level 5

Many thanks Steve!

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1549 views
  • 0 likes
  • 2 in conversation