BookmarkSubscribeRSS Feed
BTAinRVA
Quartz | Level 8

I've been working thru some of the chapters in the Handbook of Statistical Analyses using SAS book. In chapter 9 section 3.2 Analyzing FAP Data, they use Proc Genmod with the Gamma distribution because there are some rather large counts. I've done some googling but can't really find an explanation on when I should use the Gamma distribution as opposed to Poisson or Negative Binomial. When I analyze the FAP data using Poisson distribution I get very different results than with the Gamma.

 

Any thoughts greatly appreciated!

 

 

The code and data can be downloaded here:

http://support.sas.com/downloads/package.htm?pid=1693

3 REPLIES 3
Rick_SAS
SAS Super FREQ

I don't have the book, so I can't comment on that example. However, there is an example in the PROC GENMOD doc that discusses using a gamma link for survival-type data. Maybe relevant?

Ksharp
Super User

Yeah. Rick direct you an good example. Gamma is usually for Positive Value(like age , survival time ...).

Poisson or Negative Binomial is usually for sparse/discrete data, if the happened proabiltiy of event is very small , you should consider to use Poisson or Negative Binomial.

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

The gamma is a 2-parameter continuous distribution for 0 < y < infinity. Generally, it is used for right-skewed distributions (a special case is the 1-parameter exponential distribution). In principle, it would not be used for counts, which are discrete. The Poisson and negative binomial are two common and popular distributions for count data. However, these distributions can be problematic when the counts are very large. For example, with bacterial cell counts, one might have values such as 10^4, 10^5, 10^8, and so on. In principle, every value from 0 to 10^8 is possible with a discrete distribution, but you can see the problem in dealing with this as a discrete random variable. Thus, as an approximation, the gamma (or log-normal or Weibull) is often used as an approximation of the discrete distribution when there are very large counts or the range of counts is very large. The gamma has the desirable property that the variance is a function of the mean, which is one of the properties of typical discrete distributions. 

 

Be careful when using the gamma: it is defined only for values of y larger than 0. If you have any zeros, they become missing values in the analysis (something you probably don't want). There are more general versions of the gamma that allow for 0 values, but these are not available in GENMOD.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 2103 views
  • 3 likes
  • 4 in conversation