BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
juanvg1972
Pyrite | Level 9

Hi,

 

I am trying to generate a model to predict the number of absences of a school alumn. My inputs var are: math and literature marks of the alumns and their sex. The number of absences vary from 0 to 12. I am using a GLM model with 'proc genmod'

 

proc genmod data = test;
 class sexo;
 model diasau = nota_mates nota_lengua sexo / dist=Poisson;
 store out=modelo_glm;
run;

I have some doubts:

 

- How can I select the apropiate distibution: Poisson...?

-  Do I have to do some log transformation?

 

I am not an expert in GLM models, but I know that I have to review the model deviance vs degree of freedoms:

Thats what I get:

 

genmod.png

 

Any advice will be greatly appreciated.

 

Thanks in advance?

1 ACCEPTED SOLUTION

Accepted Solutions
StatDave
SAS Super FREQ

A count response is most typically modeled using the Poisson or the negative binomial distribution. The negative binomial distribution is used when the Poisson model shows evidence of overdispersion as discussed in this note. Since your "Value/DF" values are all less than 1, there doesn't seem to be any evidence of overdispersion. When fitting either of those distributions, the log link function is the default, so that the model is log(mean) = intercept + b1*x1 + b2*x2 + ... .  So, you should not apply any transformation to your response.

View solution in original post

1 REPLY 1
StatDave
SAS Super FREQ

A count response is most typically modeled using the Poisson or the negative binomial distribution. The negative binomial distribution is used when the Poisson model shows evidence of overdispersion as discussed in this note. Since your "Value/DF" values are all less than 1, there doesn't seem to be any evidence of overdispersion. When fitting either of those distributions, the log link function is the default, so that the model is log(mean) = intercept + b1*x1 + b2*x2 + ... .  So, you should not apply any transformation to your response.

SAS Innovate 2025: Register Now

Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1400 views
  • 1 like
  • 2 in conversation