Thank you for your response!
"GLIMMIX allows for a variety of correlated data, including multilevel effects. It also deals with missingness up to missing at random (it does not eliminate records that have missing values for model factors)"
So, I can account for the correlation of subjects within the same cluster by indicating the cluster variable (hospital/school) a random effect. In other words, I can capture the variability among subjects.
Use the first example in the PROC GEE documentation for a good comparison of marginal and random effect models. I have two concerns. The first is that I am not clear on the use of Drug as a response variable when your research question talks about comparing levels of Drug. I would consider Drug as a fixed effect to be included in the model, and the response to be something measured on the patient (cured-not cured, for example).
Yes, I agree! Let's go with your example.
The second is that I cannot tell if there are two levels of clustering here--patient level and hospital level. If each patient is measured one time then there are no patient level clusters - the patient level effect is the "residual" or scale estimate. Since you mention that the design is repeated this would be treated as a patient/student level cluster.
The data is hierarchal. Patients are measured each year on the same variables (e.g., alcohol consumption, depression). So, I have patient data and then patients are recruited from different hospitals. The clustering is at the hospital level. So my repeated statement would be "repeated subject = HospitalID".
However, hospital needs to be specified as either fixed (inference space is then repeated studies at the specified hospitals) or random (inference space is repeated studies at the greater population of hospitals, of which the ones in the data represent a "random" sample). From this, I can see a PROC GEE approach for the narrow inference space of the sample of hospitals, or a PROC GLIMMIX approach for the broad inference space of "all hospitals".
So, I can use either PROC GEE or PROC GLIMMIX depending on whether I decide to state "hospitalID" as a random effect or fixed effect.
I'm not quite grasping what the difference between "fixed - inference space repeated at specified hospitals" and "random - inference space is repeated studies at the greater population of hospitals". It seems like the variability that is caused by patients being recruited from different hospitals will be accounted for in each procedure
Thank you for taking the time to help!
... View more