Statistical Procedures

johnrizk · Posted 06-26-2023 12:51 PM

Hi everyone,

I am currently running county-level data (3130 counties) to study the association between prescription rates (continuous, independent variable) and a county being labeled as "high-risk" (dependent variable, can be 1=high-risk or 0=non-high-risk).

The prescription rates are at the county level, and so is the outcome variable "high-risk". While I'm not looking at state-level data, it's important to bear in mind that counties are clustered within states, so we need to account for clustering of counties within states.

Based on this idea, should the "repeated" statement be for the county or the state? I am sharing 2 SAS codes below and would appreciate advise on what be more appropriate. Thank you so much!

proc genmod data=dataset descending;

CLASS State /param=ref;

model highrisk= Opioid_Rx_Rate /dist=binomial link=logit;

repeated subject = State/type=exch;

estimate 'Opioid_Rx_Rate' Opioid_Rx_Rate 1 / exp;

run;

quit;

OR

proc genmod data=dataset descending;

CLASS County /param=ref;

model highrisk= Opioid_Rx_Rate /dist=binomial link=logit;

repeated subject = County /type=exch;

estimate 'Opioid_Rx_Rate' Opioid_Rx_Rate 1 / exp;

run;

quit;

StatDave · Posted 06-26-2023 01:59 PM

The purpose of the SUBJECT= option in the REPEATED statement of PROC GENMOD is simply to distinguish those observations that are correlated from those that aren't. That is, it defines the clusters of correlated observations. Observations with the same value of the SUBJECT= effect belong to the same cluster and are assumed to be correlated. So, if you feel that there is correlation among all of the observations in the same state, then you should specify SUBJECT=STATE.

View solution in original post

StatDave · Posted 06-26-2023 01:59 PM

The purpose of the SUBJECT= option in the REPEATED statement of PROC GENMOD is simply to distinguish those observations that are correlated from those that aren't. That is, it defines the clusters of correlated observations. Observations with the same value of the SUBJECT= effect belong to the same cluster and are assumed to be correlated. So, if you feel that there is correlation among all of the observations in the same state, then you should specify SUBJECT=STATE.

Statistical Procedures

PROC GENMOD/PROC GEE for repeated County-level data

Re: PROC GENMOD/PROC GEE for repeated County-level data

Re: PROC GENMOD/PROC GEE for repeated County-level data

PROC GEE (or proc GENMOD) does not converge

PROC GENMOD statistics

Maximum Cluster Size in proc genmod

Proc GenMod and ODS output

PROC GLM/PROC GLIMMIX/ PROC GENMOD

Follow Us

What is...

Statistical Procedures

Our biggest data and AI event of the year.

Follow Us

What is...