<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: PROC GLIMMIX Issue with Residuals in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25318#M5715</link>
    <description>As indicated elsewhere, you must have multiple observations for each ID (individual) if you have ID as a random effect. A statement such as plots=residualpanel should give you four graphs, including a 'normal' quantile plot (residual vs. quantile on a normal scale), residual vs. linear predictor (which is the estimate logit here), a histogram of residuals, and a boxplot. I prefer that you use plots=studentpanel to get the studentized residuals (actually conditional studentized residuals here). Easier to spot outliers. &lt;BR /&gt;
&lt;BR /&gt;
Since your response is binary (0/1), these diagnostic plots are challenging. Although you can do all the standard residual plots, but as stated by David Collett in Modelling Binary Data, "some of them become difficult to interpret." You can get strange looking residual plots. The Collett book has an excellent chapter on GLM diagnostics, although he does not deal with random effects (in that chapter).&lt;BR /&gt;
&lt;BR /&gt;
GLIMMIX does not (yet) have formal influence diagnostics (as found in MIXED).</description>
    <pubDate>Thu, 10 Mar 2011 16:29:23 GMT</pubDate>
    <dc:creator>lvm</dc:creator>
    <dc:date>2011-03-10T16:29:23Z</dc:date>
    <item>
      <title>PROC GLIMMIX Issue with Residuals</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25314#M5711</link>
      <description>Hi, my name is Andy and I'm analyzing a large dataset using SAS Proc Glimmix&lt;BR /&gt;
procedure. My dataset contains over 20,000 GPS records. I'm trying to&lt;BR /&gt;
evaluate why certain deer were observed during hunting season thus I've coded&lt;BR /&gt;
the deer that were observed with a "1" and those not observed with a "0." I&lt;BR /&gt;
coded the entire our that the deer was observed to encompass any hunter&lt;BR /&gt;
recording errors. My model is shown below: &lt;BR /&gt;
&lt;BR /&gt;
PROC GLIMMIX DATA=OBS METHOD=LAPLACE;&lt;BR /&gt;
 CLASS ID YEAR EXPOSURE HABITAT_VALUE;&lt;BR /&gt;
 MODEL OBSERVED (EVENT = '1') = EXPOSURE STEPLENGTH HABITAT_VALUE ELEVATION&lt;BR /&gt;
DIST_NEAREST_ROAD / DIST=BINARY LINK=LOGIT SOLUTION;&lt;BR /&gt;
   RANDOM ID YEAR;&lt;BR /&gt;
RUN;&lt;BR /&gt;
&lt;BR /&gt;
I want to see if the different independent variables influence the&lt;BR /&gt;
observation of deer throughout the hunting season. My question is what are&lt;BR /&gt;
the assumptions that I need to adhere to with logistic regression. I read&lt;BR /&gt;
that the data does not need to be normally distributed. I know "steplength"&lt;BR /&gt;
is extremely right skewed with the mean of 48 meters and a max value of 1,400&lt;BR /&gt;
meters. If normality is not an issue then I assumed the next step would be to&lt;BR /&gt;
at least examine the residuals and remove some of those extreme movements. I&lt;BR /&gt;
added the PLOT=RESIDUALPANEL option to my model with ODS GRAPHICS and plotted&lt;BR /&gt;
the residuals. The residuals looked very different than what I'd see in a&lt;BR /&gt;
PROC MIXED model and I was unable to interpret the plots to determine if I&lt;BR /&gt;
need to remove any outliers. Will I not receive a normal residual plot,&lt;BR /&gt;
similar to PROC MIXED? If so, how do you interpret residual plots from PROC&lt;BR /&gt;
GLIMMIX. Thank you very much!</description>
      <pubDate>Wed, 09 Mar 2011 14:17:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25314#M5711</guid>
      <dc:creator>Buck1480</dc:creator>
      <dc:date>2011-03-09T14:17:11Z</dc:date>
    </item>
    <item>
      <title>Re: PROC GLIMMIX Issue with Residuals</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25315#M5712</link>
      <description>Andy,&lt;BR /&gt;
&lt;BR /&gt;
You need to repost this in the Statistical forum.  There are readers there that may be able to help.&lt;BR /&gt;
&lt;BR /&gt;
My first thought is that a residual in a logistic regression is going to be bounded on the probability scale, so you probably want to plot using something like:&lt;BR /&gt;
&lt;BR /&gt;
proc glimmix plots=(ResidualPanel(marginal)&lt;BR /&gt;
                    ResidualPanel(unpack conditional);&lt;BR /&gt;
&lt;BR /&gt;
This will give the residuals both using the random effect predictors (conditional) and averaging over the random effects (marginal).  I don't know if influence statistics (Cook's D, DFFITS) are available for GLIMMIX.&lt;BR /&gt;
&lt;BR /&gt;
I have one question about the variable ID--does it refer to an individual deer, and if so are there repeated observations on that animal?  Then some spatial modeling might be in order as well, or grouping variances by animal, or, well, a whole bundle of things, but probably not relevant to your question about the plots.&lt;BR /&gt;
&lt;BR /&gt;
Good luck,&lt;BR /&gt;
&lt;BR /&gt;
SteveDenham</description>
      <pubDate>Wed, 09 Mar 2011 16:07:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25315#M5712</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2011-03-09T16:07:29Z</dc:date>
    </item>
    <item>
      <title>Re: PROC GLIMMIX Issue with Residuals</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25316#M5713</link>
      <description>Steve, &lt;BR /&gt;
&lt;BR /&gt;
    Yes, ID refers to an individual deer. I tried running the model with different covariance structures such as: VC (default), CS, AR(1), and UN. The default covariance structure (VC) provided me with the best fit model based on AICc. I've tried running the spatial power covariance structure in MIXED when I was analyzing movement data but would receive an error message stating that it stopped because of an infinite likelihood. I determined that the error was due having multiple lines of data for one indvidual deer. Unfortunately, I wasn't sure how to overcome this and was told by a statistician to use another covariance structure. Thank you for your help!</description>
      <pubDate>Wed, 09 Mar 2011 16:17:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25316#M5713</guid>
      <dc:creator>Buck1480</dc:creator>
      <dc:date>2011-03-09T16:17:49Z</dc:date>
    </item>
    <item>
      <title>Re: PROC GLIMMIX Issue with Residuals</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25317#M5714</link>
      <description>Aha!  The infinite likelihood caused by multiple lines per subject problem.&lt;BR /&gt;
&lt;BR /&gt;
You can fix this by respecifying the subject, so that instead of subject=ID, you use subject=ID*&lt;APPROPRIATE classification="" variable="" goes="" here=""&gt;.  Something makes each line unique, and it should be included on the CLASS statement.  A good guess would be one of the fixed effects, say exposure (just a guess, not sure at all).  If that is the situation then subject=ID*EXPOSURE might fix the infinite likelihood.  It might get more complex to the point that subject=ID*EXPOSURE*HABITAT_VALUE may be needed.&lt;BR /&gt;
&lt;BR /&gt;
This still doesn't address the residual plot problem.  I keep hoping someone will drop a hint in here.&lt;BR /&gt;
&lt;BR /&gt;
SteveDenham&lt;/APPROPRIATE&gt;</description>
      <pubDate>Thu, 10 Mar 2011 11:50:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25317#M5714</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2011-03-10T11:50:31Z</dc:date>
    </item>
    <item>
      <title>Re: PROC GLIMMIX Issue with Residuals</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25318#M5715</link>
      <description>As indicated elsewhere, you must have multiple observations for each ID (individual) if you have ID as a random effect. A statement such as plots=residualpanel should give you four graphs, including a 'normal' quantile plot (residual vs. quantile on a normal scale), residual vs. linear predictor (which is the estimate logit here), a histogram of residuals, and a boxplot. I prefer that you use plots=studentpanel to get the studentized residuals (actually conditional studentized residuals here). Easier to spot outliers. &lt;BR /&gt;
&lt;BR /&gt;
Since your response is binary (0/1), these diagnostic plots are challenging. Although you can do all the standard residual plots, but as stated by David Collett in Modelling Binary Data, "some of them become difficult to interpret." You can get strange looking residual plots. The Collett book has an excellent chapter on GLM diagnostics, although he does not deal with random effects (in that chapter).&lt;BR /&gt;
&lt;BR /&gt;
GLIMMIX does not (yet) have formal influence diagnostics (as found in MIXED).</description>
      <pubDate>Thu, 10 Mar 2011 16:29:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/PROC-GLIMMIX-Issue-with-Residuals/m-p/25318#M5715</guid>
      <dc:creator>lvm</dc:creator>
      <dc:date>2011-03-10T16:29:23Z</dc:date>
    </item>
  </channel>
</rss>

