BookmarkSubscribeRSS Feed
mydamj
Calcite | Level 5

when the dependent variable is count (output of proc freq) and most of the independent variables are also count, what regression model is ideal to use. Also, what SAS procedure should be used?

 

example: Instead of individual data, grouped data(like county wise) data is available. 

2 REPLIES 2
sbxkoenk
SAS Super FREQ

BR, Koen

StatDave
SAS Super FREQ

If, at the subject level, the response is binary (such as yes/no, positive/negative) so that your data are aggregated binary data and you have the numerator and denominator counts making up the proportions, then you can fit a logistic model in procedures such as LOGISTIC, PROBIT, GENMOD, GAM, ADAPTIVEREG and others by using the events/trials syntax in the MODEL statement. The model assumes that the proportions represent a set of independent Bernoulli trials and have a binomial distribution. See the example in the Getting Started section of the PROC LOGISTIC documentation. But if your data are counts of some event over a period of time (such as person-years) such that the ratio of count/time is a rate that can exceed 1, then you would typically model the rate using a Poisson or negative binomial model with the log of the time variable as an offset. See the Poisson Regression example in the Getting Started section of the PROC GENMOD documentation. 

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 16. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 441 views
  • 7 likes
  • 3 in conversation