BookmarkSubscribeRSS Feed
techplexus
Calcite | Level 5

I have data from patients who underwent tests at different time points. Some underwent 10 times over 1 year and some 100 times over 20 years. I want to analyse whether drop in a variable measured at those times was significant cause of an event like death. 

For example one patient had blood pressure measured at 1, 3,7 weeks. Another at 2, 6,9, 66, 98 weeks. So if drop in blood pressure at time was significant cause of death.

6 REPLIES 6
StatDave
SAS Super FREQ

There are several types of models that can be used to model clusters of correlated observations like you describe. One easy approach is the Generalized Estimating Equations (GEE) model which you can fit using PROC GEE. See the example for modeling a binary response (like your death response) using this model in the Getting Started section of the PROC GEE documentation. This model does not require equal numbers of observations per cluster (subject) nor that observations be at the same times, particularly if you use a correlation structure like TYPE=IND or EXCH as in that example. The predictors can vary over time within subjects. You should, however, have a large number of clusters for validity of the method.

techplexus
Calcite | Level 5

I have about 736 patients' data. Will that be sufficient for this model?

StatDave
SAS Super FREQ
Yes. Hundreds of clusters would generally be considered adequate for validity of the method.
StatDave
SAS Super FREQ
You can easily get a plot showing the effect of a continuous predictor on the event probability using an EFFECTPLOT statement. For example, adding this statement in the Getting Started example in the PROC GEE documentation gives a plot of the fitted model showing the effect of Age with other variables fixed.
effectplot fit(x=age)/ilink;
techplexus
Calcite | Level 5

Thank you for your reply. For example if my independent variables are continuous variables like systolic blood pressure , diastolic blood pressure and age. Patients are then diagnosed to be having heart disease like Myocardial infarction. Final endpoint variables are death or time to death. 

I want to know how much drop in SBP or DBP before or after diagnosis of heart disease would be significant to predict high chances of death so that additional treatment can be given to those having high risk. 

Additionally if variables like gender or education status can predict the death (this is secondary objective).

techplexus
Calcite | Level 5

I am not sure if those kind of predictions can be done with GEE or not as the output tables didn't give me much information.

SAS Innovate 2025: Register Now

Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 6 replies
  • 864 views
  • 3 likes
  • 2 in conversation