BookmarkSubscribeRSS Feed
mydamj
Calcite | Level 5

when the dependent variable is count (output of proc freq) and most of the independent variables are also count, what regression model is ideal to use. Also, what SAS procedure should be used?

 

example: Instead of individual data, grouped data(like county wise) data is available. 

2 REPLIES 2
sbxkoenk
SAS Super FREQ

BR, Koen

StatDave
SAS Super FREQ

If, at the subject level, the response is binary (such as yes/no, positive/negative) so that your data are aggregated binary data and you have the numerator and denominator counts making up the proportions, then you can fit a logistic model in procedures such as LOGISTIC, PROBIT, GENMOD, GAM, ADAPTIVEREG and others by using the events/trials syntax in the MODEL statement. The model assumes that the proportions represent a set of independent Bernoulli trials and have a binomial distribution. See the example in the Getting Started section of the PROC LOGISTIC documentation. But if your data are counts of some event over a period of time (such as person-years) such that the ratio of count/time is a rate that can exceed 1, then you would typically model the rate using a Poisson or negative binomial model with the log of the time variable as an offset. See the Poisson Regression example in the Getting Started section of the PROC GENMOD documentation. 

sas-innovate-white.png

Missed SAS Innovate in Orlando?

Catch the best of SAS Innovate 2025 — anytime, anywhere. Stream powerful keynotes, real-world demos, and game-changing insights from the world’s leading data and AI minds.

 

Register now

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 889 views
  • 7 likes
  • 3 in conversation