<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic PROC GENMOD/PROC GEE for repeated County-level data in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-GENMOD-PROC-GEE-for-repeated-County-level-data/m-p/882480#M43647</link>
    <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I am currently running county-level data (3130 counties) to study the association between prescription rates (continuous, independent variable) and a county being labeled as "high-risk" (dependent variable, can be 1=high-risk or 0=non-high-risk).&lt;/P&gt;&lt;P&gt;The prescription rates&amp;nbsp; are at the county level, and so is the outcome variable "high-risk". While I'm not looking at state-level data, it's important to bear in mind that counties are clustered within states, so we need to account for clustering of counties within states.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Based on this idea, should the "repeated" statement be for the county or the state? I am sharing 2 SAS codes below and would appreciate advise on what be more appropriate. Thank you so much!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;proc genmod data=dataset descending;&lt;DIV&gt;CLASS State &amp;nbsp;/param=ref;&lt;/DIV&gt;&lt;DIV&gt;model highrisk= Opioid_Rx_Rate &amp;nbsp;/dist=binomial link=logit;&lt;/DIV&gt;&lt;DIV&gt;repeated subject = State/type=exch;&lt;/DIV&gt;&lt;DIV&gt;estimate 'Opioid_Rx_Rate' Opioid_Rx_Rate 1 / exp;&lt;/DIV&gt;&lt;DIV&gt;run;&lt;/DIV&gt;quit;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;OR&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;proc genmod data=dataset descending;&lt;/SPAN&gt;&lt;DIV&gt;CLASS County /param=ref;&lt;/DIV&gt;&lt;DIV&gt;model highrisk= Opioid_Rx_Rate &amp;nbsp;/dist=binomial link=logit;&lt;/DIV&gt;&lt;DIV&gt;repeated subject =&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;County&amp;nbsp;&lt;/SPAN&gt;/type=exch;&lt;/DIV&gt;&lt;DIV&gt;estimate 'Opioid_Rx_Rate' Opioid_Rx_Rate 1 / exp;&lt;/DIV&gt;&lt;DIV&gt;run;&lt;/DIV&gt;&lt;SPAN&gt;quit;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 26 Jun 2023 16:51:39 GMT</pubDate>
    <dc:creator>johnrizk</dc:creator>
    <dc:date>2023-06-26T16:51:39Z</dc:date>
    <item>
      <title>PROC GENMOD/PROC GEE for repeated County-level data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-GENMOD-PROC-GEE-for-repeated-County-level-data/m-p/882480#M43647</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I am currently running county-level data (3130 counties) to study the association between prescription rates (continuous, independent variable) and a county being labeled as "high-risk" (dependent variable, can be 1=high-risk or 0=non-high-risk).&lt;/P&gt;&lt;P&gt;The prescription rates&amp;nbsp; are at the county level, and so is the outcome variable "high-risk". While I'm not looking at state-level data, it's important to bear in mind that counties are clustered within states, so we need to account for clustering of counties within states.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Based on this idea, should the "repeated" statement be for the county or the state? I am sharing 2 SAS codes below and would appreciate advise on what be more appropriate. Thank you so much!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;proc genmod data=dataset descending;&lt;DIV&gt;CLASS State &amp;nbsp;/param=ref;&lt;/DIV&gt;&lt;DIV&gt;model highrisk= Opioid_Rx_Rate &amp;nbsp;/dist=binomial link=logit;&lt;/DIV&gt;&lt;DIV&gt;repeated subject = State/type=exch;&lt;/DIV&gt;&lt;DIV&gt;estimate 'Opioid_Rx_Rate' Opioid_Rx_Rate 1 / exp;&lt;/DIV&gt;&lt;DIV&gt;run;&lt;/DIV&gt;quit;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;OR&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;proc genmod data=dataset descending;&lt;/SPAN&gt;&lt;DIV&gt;CLASS County /param=ref;&lt;/DIV&gt;&lt;DIV&gt;model highrisk= Opioid_Rx_Rate &amp;nbsp;/dist=binomial link=logit;&lt;/DIV&gt;&lt;DIV&gt;repeated subject =&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;County&amp;nbsp;&lt;/SPAN&gt;/type=exch;&lt;/DIV&gt;&lt;DIV&gt;estimate 'Opioid_Rx_Rate' Opioid_Rx_Rate 1 / exp;&lt;/DIV&gt;&lt;DIV&gt;run;&lt;/DIV&gt;&lt;SPAN&gt;quit;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 26 Jun 2023 16:51:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-GENMOD-PROC-GEE-for-repeated-County-level-data/m-p/882480#M43647</guid>
      <dc:creator>johnrizk</dc:creator>
      <dc:date>2023-06-26T16:51:39Z</dc:date>
    </item>
    <item>
      <title>Re: PROC GENMOD/PROC GEE for repeated County-level data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-GENMOD-PROC-GEE-for-repeated-County-level-data/m-p/882496#M43648</link>
      <description>&lt;P&gt;&lt;SPAN&gt;The purpose of the SUBJECT= option in the REPEATED statement of PROC GENMOD is simply to distinguish those observations that are correlated from those that aren't. That is, it defines the clusters of correlated observations. Observations with the same value of the SUBJECT= effect belong to the same cluster and are assumed to be correlated. So, if you feel that there is correlation among all of the observations in the same state, then you should specify SUBJECT=STATE.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 26 Jun 2023 17:59:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-GENMOD-PROC-GEE-for-repeated-County-level-data/m-p/882496#M43648</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2023-06-26T17:59:21Z</dc:date>
    </item>
  </channel>
</rss>

