<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Proc genmod poisson dist. in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183606#M9525</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hi all,&lt;/P&gt;&lt;P&gt;I would like to first thank you all for providing your expert advise and helping us understand SAS better.&amp;nbsp; I am new with these statistical modeling especially using proc genmod.&amp;nbsp; My variables are Region (0-9), race (0-3), rates, cases, population and year (1996-2012).&lt;/P&gt;&lt;P&gt;What I want to compare is rates among different racial groups across the regions (so compare 1-2, 1-3, 2-3, etc).&amp;nbsp; I was told in a SAS class that proc genmod with poisson distribution would be ideal.&amp;nbsp; We are dealing with not normal surveillance data and rates aren't best with other distributions. We want to control for overdispersion and get Relative Risk and Confidence intervals.&amp;nbsp; We would like to see the difference over time if possible but I know it may be difficult.&amp;nbsp; Does the model automatically takes the log or I should first get log and offset that. (data race; set race; Y=cases/population*100000; ln=log(Y); run; and&lt;/P&gt;&lt;P&gt;My model goes like this&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc genmod data=race;&lt;/P&gt;&lt;P&gt;class region race;&lt;/P&gt;&lt;P&gt;model Y=region race / dist=p link=log scale=pearson; (or 2nd model Y=region race / dist=p link=log offset=ln scale=pearson;)&lt;/P&gt;&lt;P&gt;repeated subject=region / type=unstr;&lt;/P&gt;&lt;P&gt;lsmeans region sex / exp cl tdiff e om;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks CP&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 17 Jul 2014 17:25:12 GMT</pubDate>
    <dc:creator>cp</dc:creator>
    <dc:date>2014-07-17T17:25:12Z</dc:date>
    <item>
      <title>Proc genmod poisson dist.</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183606#M9525</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hi all,&lt;/P&gt;&lt;P&gt;I would like to first thank you all for providing your expert advise and helping us understand SAS better.&amp;nbsp; I am new with these statistical modeling especially using proc genmod.&amp;nbsp; My variables are Region (0-9), race (0-3), rates, cases, population and year (1996-2012).&lt;/P&gt;&lt;P&gt;What I want to compare is rates among different racial groups across the regions (so compare 1-2, 1-3, 2-3, etc).&amp;nbsp; I was told in a SAS class that proc genmod with poisson distribution would be ideal.&amp;nbsp; We are dealing with not normal surveillance data and rates aren't best with other distributions. We want to control for overdispersion and get Relative Risk and Confidence intervals.&amp;nbsp; We would like to see the difference over time if possible but I know it may be difficult.&amp;nbsp; Does the model automatically takes the log or I should first get log and offset that. (data race; set race; Y=cases/population*100000; ln=log(Y); run; and&lt;/P&gt;&lt;P&gt;My model goes like this&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc genmod data=race;&lt;/P&gt;&lt;P&gt;class region race;&lt;/P&gt;&lt;P&gt;model Y=region race / dist=p link=log scale=pearson; (or 2nd model Y=region race / dist=p link=log offset=ln scale=pearson;)&lt;/P&gt;&lt;P&gt;repeated subject=region / type=unstr;&lt;/P&gt;&lt;P&gt;lsmeans region sex / exp cl tdiff e om;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;quit;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks CP&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 17 Jul 2014 17:25:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183606#M9525</guid>
      <dc:creator>cp</dc:creator>
      <dc:date>2014-07-17T17:25:12Z</dc:date>
    </item>
    <item>
      <title>Re: Proc genmod poisson dist.</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183607#M9526</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;My thoughts;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Change Y to cases.&amp;nbsp; Integer counts of the number of cases.&amp;nbsp; The offset then would be log(population/100000).&amp;nbsp; I see 'sex' in the lsmeans statement, but it is not in the model statement, so that will lead to non-execution. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Also, with region having 10 levels, you are estimating 45 parameters with an unstructured covariance matrix.&amp;nbsp; Using a rule of thumb of at least 10 observations per parameter, you will need a moderate sized dataset.&amp;nbsp; You may wish to consider a more restrictive covariance matrix--in particular type=ind, which assumes separate variances for each region, and that the regions are independent (well, uncorrelated).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As far as overdispersion, the first thing I would think of would be shifting to a negative binomial distribution.&amp;nbsp; If there is still an overdispersion problem, you will likely have to switch procedures to GLIMMIX.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 18 Jul 2014 14:36:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183607#M9526</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2014-07-18T14:36:36Z</dc:date>
    </item>
    <item>
      <title>Re: Proc genmod poisson dist.</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183608#M9527</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for your help.&amp;nbsp; I greatly appreciate it.&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt;I have modified the model and which seems to be working.&lt;/P&gt;&lt;P&gt;Proc genmod data=ctrace96;&lt;/P&gt;&lt;P&gt;class region race;&lt;/P&gt;&lt;P&gt;model cases=region sex/dist=p offset=lnn scale=pearson type3;&lt;/P&gt;&lt;P&gt;repeated subject=region/type=ind;&lt;/P&gt;&lt;P&gt;lsmeans region sex/exp cl diff e om; run; quit;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This does comparison and produce confidence interval and odds ratio.&amp;nbsp; What I am interested in is Risk ratio and differences of those ratios.&amp;nbsp; Is there any option that I can include or any way to find RR? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;How do we test if this model controls effectively for overdispersion?&amp;nbsp; I have also tried Glimmix but have not been able to produce a comparison table like proc genmod does.&amp;nbsp; As you suggested, overdispersion can be controlled by negative binomial dist.&amp;nbsp; Is it ok to just use DIST=NEGBIN in above model and keep rest same.&amp;nbsp; Output for the above model doesn't give goodness of fit table so can't figure out if overdispersion is an issue or not. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;CP&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 21 Jul 2014 18:46:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183608#M9527</guid>
      <dc:creator>cp</dc:creator>
      <dc:date>2014-07-21T18:46:54Z</dc:date>
    </item>
    <item>
      <title>Re: Proc genmod poisson dist.</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183609#M9528</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;As far as changing the distribution, you are on the right track.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Deviance or Pearson chi-square divided by degrees of freedom gives a measure of overdispersion.&amp;nbsp; In GLIMMIX you could check the Fit Statistics table for this parameter.&amp;nbsp; In GENMOD, you may have to do this by hand, and then refit using scale=&amp;lt;the overdispersion value you find&amp;gt;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As far as relative risk estimates, again it is outside my usual field.&amp;nbsp; Check for posts by &lt;A __default_attr="178104" __jive_macro_name="user" class="jive_macro jive_macro_user" data-objecttype="3" href="https://communities.sas.com/"&gt;&lt;/A&gt; that may address this.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;P&gt;&lt;BR /&gt; &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 22 Jul 2014 13:56:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-genmod-poisson-dist/m-p/183609#M9528</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2014-07-22T13:56:07Z</dc:date>
    </item>
  </channel>
</rss>

