<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816418#M40296</link>
    <description>&lt;P&gt;Hi All,&lt;/P&gt;
&lt;P&gt;I have a question confusing me. The model is generalized linear model with negative binomial distribution. My data are insect counts from 10 dates and 7 locations. The counts are all zeros on 7/16/21. My question is that why the standard error on 7/16/21 is the greatest one, 484.43? As you can see the data scale standard error is still close to 0, which is consistent to the raw data. Another question is that the mean separation is based on the estimates on model scale, then is the mean separation still correct (0 variance on raw data, but greatest variance on model scaled values)?&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc glimmix data=two plots=residualpanel method=quad;
  by year;
  class date field_id;
  model coe_female = date field_id/dist=NB;
  nloptions tech=nrridg;
  lsmeans date/adjust=Tukey lines ilink plots=meanplot(join); 
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RosieSAS_0-1654286924454.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/71990i82350127A6B4FB34/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RosieSAS_0-1654286924454.png" alt="RosieSAS_0-1654286924454.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Thanks a lot!&lt;/P&gt;
&lt;P&gt;Rosie&lt;/P&gt;</description>
    <pubDate>Fri, 03 Jun 2022 20:13:51 GMT</pubDate>
    <dc:creator>RosieSAS</dc:creator>
    <dc:date>2022-06-03T20:13:51Z</dc:date>
    <item>
      <title>Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816418#M40296</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;
&lt;P&gt;I have a question confusing me. The model is generalized linear model with negative binomial distribution. My data are insect counts from 10 dates and 7 locations. The counts are all zeros on 7/16/21. My question is that why the standard error on 7/16/21 is the greatest one, 484.43? As you can see the data scale standard error is still close to 0, which is consistent to the raw data. Another question is that the mean separation is based on the estimates on model scale, then is the mean separation still correct (0 variance on raw data, but greatest variance on model scaled values)?&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc glimmix data=two plots=residualpanel method=quad;
  by year;
  class date field_id;
  model coe_female = date field_id/dist=NB;
  nloptions tech=nrridg;
  lsmeans date/adjust=Tukey lines ilink plots=meanplot(join); 
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RosieSAS_0-1654286924454.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/71990i82350127A6B4FB34/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RosieSAS_0-1654286924454.png" alt="RosieSAS_0-1654286924454.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Thanks a lot!&lt;/P&gt;
&lt;P&gt;Rosie&lt;/P&gt;</description>
      <pubDate>Fri, 03 Jun 2022 20:13:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816418#M40296</guid>
      <dc:creator>RosieSAS</dc:creator>
      <dc:date>2022-06-03T20:13:51Z</dc:date>
    </item>
    <item>
      <title>Re: Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816843#M40317</link>
      <description>&lt;P&gt;Is there anyone can give me a hint or insights of the confusing result? Any thought will be appreciated!&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp;,&amp;nbsp;@StatDave_sas.&lt;/P&gt;</description>
      <pubDate>Tue, 07 Jun 2022 12:28:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816843#M40317</guid>
      <dc:creator>RosieSAS</dc:creator>
      <dc:date>2022-06-07T12:28:36Z</dc:date>
    </item>
    <item>
      <title>Re: Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816849#M40318</link>
      <description>&lt;P&gt;Your question seems to be "&lt;SPAN&gt;Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM."&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;In general, I don't think it does. It depends on the data. For your data, the reference category is the last category of the DATE variable. The mean of the data in the reference level is 0.86.&amp;nbsp; In contrast, the mean of the values in the DATE='16JUL2021'd is -13.7. This is far away from the reference mean, which is why the variance of the difference between the mean is so large.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt; So your result is not because the counts for&amp;nbsp;'16JUL2021'd are a constant (0), but because the constant is far from the average count for the reference category.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 07 Jun 2022 13:00:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816849#M40318</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2022-06-07T13:00:32Z</dc:date>
    </item>
    <item>
      <title>Re: Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816850#M40319</link>
      <description>Thanks for your reply, &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;. I can understand why the mean value on 7/16/2021 is the smallest, -13.7, but not the standard error. Do you mean the standard error, 484.43, reflects the variance of the difference between the mean on 7/16/2021 and on 09/03/2021? That is totally different from my understanding that the standard errors in LS-mean table are the estimated standard errors of each particular level of the means on model scale, not relative values depending on the reference level. Then why the standard error from ILINK option on raw data scale is not the greatest anymore?</description>
      <pubDate>Tue, 07 Jun 2022 13:14:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816850#M40319</guid>
      <dc:creator>RosieSAS</dc:creator>
      <dc:date>2022-06-07T13:14:11Z</dc:date>
    </item>
    <item>
      <title>Re: Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816856#M40320</link>
      <description>&lt;P&gt;When I checked the estimated parameters, the stdr is 484.43 of 7/16/2021. The stdr of differences of DATA means when comparing 7/16/2021 to other dates, they are always 484.43. Because of the huge stdr, it is difficult to detect any significant difference between the counts on 7/16/2021 to other dates. Do you know why? Thanks!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RosieSAS_0-1654608131918.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/72037i445EE45A2CA0C0F6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RosieSAS_0-1654608131918.png" alt="RosieSAS_0-1654608131918.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RosieSAS_1-1654608159445.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/72038i9DBD980CE845E4BE/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RosieSAS_1-1654608159445.png" alt="RosieSAS_1-1654608159445.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 07 Jun 2022 13:29:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/816856#M40320</guid>
      <dc:creator>RosieSAS</dc:creator>
      <dc:date>2022-06-07T13:29:18Z</dc:date>
    </item>
    <item>
      <title>Re: Why does zero variance in raw data have the greatest variance in estimated model scale in GLMM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/818588#M40445</link>
      <description>&lt;P&gt;I suspect that the Hessian matrix is nearly singular - the zero count for that date isn't handled well by the log link.&amp;nbsp; I would be very tempted to remove the 7/16/21 records, if in fact there are nothing but zeroes observed then.&amp;nbsp; Also, since you have no RANDOM effects in your model, you may wish to do the analysis in PROC GENMOD, where you at least have the option of fitting a zero-inflated model.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;SteveDenham&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2022 14:57:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Why-does-zero-variance-in-raw-data-have-the-greatest-variance-in/m-p/818588#M40445</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2022-06-16T14:57:15Z</dc:date>
    </item>
  </channel>
</rss>

