Hello,
I’m in the early stages of developing a model to identify patient, provider, and encounter-level factors significantly associated with issuance of a specific type of prescription. Here’s some background:
It is this last point (the distribution of repeat encounters) that I’m wondering about. That is, I’m wondering if the high proportion of patients with only a single encounter will be an issue in a model designed for repeated observations. I’m also wondering if the skewed distribution in repeat observations might cause problems. To date, I’ve been unable to find any examples/commentary on modelling data with such characteristics.
This may be a non-issue. However, I thought I’d ask before I started building the model in earnest, so I can avoid any pitfalls at the outset. Any thoughts/advice/comments are appreciated.
Thanks,
Stuart
My first thought was to fit the repeated measures covariance structure with a spline, but that may give those single digit encounters on the right tail too much emphasis. Perhaps binning into some reasonable sizes would help, unless it becomes obvious that those patients who are doctor shopping until they get a response are in this category. To me the substantive question isn't whether a prescription is obtained on the 12th or the 40th encounter--those are all the same.
The other thing is to maybe turn this "inside out" and deal with it as a time to event analysis, with those that never receive a prescription viewed as censored after their last encounter. If you have a hierarchical model, though, this may not be so easily done.
Steve Denham
Thanks for your input Steve (and apologies for the tardy reply).
After consulting with colleagues, I ended up just using a pretty straightforward GEE model using PROC GENMOD. The model was not at all sensitive to the repeated measures covariance structure, I'm guessing due to the large number of observations compared to the number of repeated measures.
Thanks again,
Stuart
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9.
Early bird rate extended! Save $200 when you sign up by March 31.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.