Hello everyone. I would like to trend the amount of substance used over a period of time for individuals using a particular service to see whether there is an increase in substance use over time or not. Also I would like to know whether these changes are associated with a cardiac problem or not. The amount of substance (y axis) is in ranges, hence it is not continuous, while the Y axis is time. Each subject has measures at several points and these timepoints are not consistent for all the participants. No comparison group.. The cardiac event is a dichotomous variable (Yes/No).
There could be several approaches or models to address your questions and the choice of the most appropriate one for your data and goals has to be yours. That said, one approach is to fit a suitable Generalized Estimating Equations (GEE) model to accommodate the repeated measurements on each subject. I assume that the two cardiac groups consist of two independent sets of subjects. If you assume that your ordinal response is at least roughly normally distributed in each group at each time and that any trend over time is approximately linear, then the model could simply allow for an intercept and a slope for each of the two cardiac groups. The slopes measure the trend in each group.
The basic idea is similar to what is discussed in this note on intervention analysis, but doesn't involve an intervention. The situation described above is like the right-hand (post intervention) side of the data plot in the "Treated and Control Groups, Continuous Response" section of the note. The following generates the same example data but keeps only the observations for the two independent groups over months 4-6.
data a;
sd=3;
keep id exposed month iv expiv y;
do exposed=1,0;
do month=1 to 6;
iv=(month>3);
expiv=cats(exposed,'_',iv);
input mean @;
do ID=1 to 10;
Y=rannor(23425)*sd+mean;
if iv then output;
end;
end; end;
datalines;
26 30 32 45 52 63
27 31 33 29 32 36
;
The PROC GEE step fits and plots a model that allows for separate intercepts (EXPOSED) and slopes or trends (EXPOSED*MONTH) for the two exposure groups (like your cardiac groups). Because the model contains a single, common slope estimate (MONTH), the test for the MONTH*EXPOSED interaction is a test of whether the slopes for the two exposed groups are the same. The highly significant test for the interaction indicates that the group trends differ. Whether the subjects are all observed at that same set of time points doesn't affect this particular analysis.
proc gee data=a;
class id exposed;
model y = exposed month exposed*month;
repeated subject=id;
effectplot slicefit(x=month sliceby=exposed) / extend=data clm;
run;
You could then fit a form of the model that directly shows the estimated intercepts (EXPOSED) and slopes (EXPOSED*MONTH) for the two groups.
proc gee data=a;
class id exposed;
model y = exposed exposed*month / noint;
repeated subject=id;
effectplot slicefit(x=month sliceby=exposed) / extend=data clm;
run;
Hello. Thanks so much for your response. Highly appreciated. I would like to see the trend of amount of substance use over time for individuals and not groups. No comparison of trends between the groups either. The unit of the substance amount is in ranges i.e. categorical. Is it possible to conduct analysis without comparing groups in this case? Can you please clarify whether the type of analysis that you suggested would work for that? Thanks.
Hello everyone. I would like to trend the amount of substance used over a period of time for individuals using a particular service to see whether there is an increase in substance use over time or not. Also I would like to know whether these changes are associated with a cardiac problem or not. The amount of substance (y axis) is in ranges, hence it is not continuous, while the Y axis is time. Each subject has measures at several points and these timepoints are not consistent for all the participants. No comparison group.. The cardiac event is a dichotomous variable (Yes/No).
My questions are:
1) What type of statistical analysis would be appropriate to trend the amount of substance used by individuals over the period of study, considering that the amount of substance use will be in ranges, not a continuous variable.
2) What analysis can I do for the 2nd part to test whether there is an association with cardiac problem?
I have been looking for research papers that may have done analysis of this type but I have not found yet. Any advice? If anyone has a resource link or research paper link that could help, that would be helpful in addition to the responses that I get from this forum. Thanks.
I think this is a duplicate of "another" topic:
Please don't post your question multiple times in several boards simultaneously.
If another board is better suited to receive your question, the topic will be moved by a user with "super-powers".
Thanks,
Koen
Hello. So sorry about that. I am new on this platform. I was not sure whether my question was posted, so I posted again. Thanks for letting me know.
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.