BookmarkSubscribeRSS Feed
57078
Calcite | Level 5
Hi,

I am trying to fit a poisson regression in GENMOD and don't think the data fit the model very well after viewing the residuals, I am already using the DSCALE option for overdispersion as well. I think I should be using a zero-inflated model but I don't have a variable that predicts the zeros really. I am using income and categorical race to predict green space. I just read somewhere that negative binomial is better for data with lots of zeros maybe I should try this?

Also should the residual plots be a random cloud around zero for count data? What other diagnostics should I use? What should they look like?

Thanks! I just ran negative binomial and it did not converge.


Message was edited by: 57078
2 REPLIES 2
plf515
Lapis Lazuli | Level 10
> Hi,
>
> I am trying to fit a poisson regression in GENMOD and
> don't think the data fit the model very well after
> viewing the residuals, I am already using the DSCALE
> option for overdispersion as well. I think I should
> be using a zero-inflated model but I don't have a
> variable that predicts the zeros really. I am using
> income and categorical race to predict green space. I
> just read somewhere that negative binomial is better
> for data with lots of zeros maybe I should try this?
>
>
> Also should the residual plots be a random cloud
> around zero for count data? What other diagnostics
> should I use? What should they look like?
>
> Thanks!
>
> I just ran negative binomial and it did not converge.
>
>
>
> age was edited by: 57078

What data are you collecting, and on what geographic area?

I'm guessing "greenspace" is something like parks and so on. But income and race are individual variables ....

Why are there a lot of 0's?

Peter
StatDave
SAS Super FREQ
A plot of the standardized deviance residuals (STDRESDEV= option) or the likelihood residuals (RESLIK= option) can be useful. A plot of either against the predicted values should be randomly scattered about zero. Points beyond about 2 or -2 are not being fit well by the model. Plots of these residuals against the predictors can help you determine the proper form of the predictors in the model.

Note the apparent overdispersion may really mean that you need additional effects in your model. If there is overdispersion, you could use the REPEATED statement to fit the model using GEE (even if you have no repeated data). GEE corrects for overdispersion. For more, see this usage note:

http://support.sas.com/kb/22630

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1498 views
  • 0 likes
  • 3 in conversation