BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
stan
Quartz | Level 8

There is geometric distribution in PROC GLIMMIX. What about hypergeometric? As far as I understand it is the same as binomial but without replacement. Thank you in advance.

1 ACCEPTED SOLUTION

Accepted Solutions
SteveDenham
Jade | Level 19

A hypergeometric distribution is not available in GLIMMIX, probably because it is really hard to fix marginal totals and still do REML.  Maybe future releases will make it available.  For now, the only way to fit a mixed model with a hypergeometric distribution would be to write the necessary code in PROC NLMIXED.

I'm curious as to how a hypergeometric distribution could be implemented in the context of mixed models, where anything that defined the marginal total would be a random variable, rather than a fixed sum.

Steve Denham

View solution in original post

3 REPLIES 3
SteveDenham
Jade | Level 19

A hypergeometric distribution is not available in GLIMMIX, probably because it is really hard to fix marginal totals and still do REML.  Maybe future releases will make it available.  For now, the only way to fit a mixed model with a hypergeometric distribution would be to write the necessary code in PROC NLMIXED.

I'm curious as to how a hypergeometric distribution could be implemented in the context of mixed models, where anything that defined the marginal total would be a random variable, rather than a fixed sum.

Steve Denham

stan
Quartz | Level 8

Steve,

I wanted to model hypergeometric distribution as my response variable is a characteristic of cells and there is no need to put them back in a vial. We consider cells with a certain characteristic and count their percent / proportion within a certain cell type. So the analyzed cell type might be considered as a subtype/subclass of cells.

Robin R. High wrote "[...] the hypergeometric distribution converges to the binomial, and then with the binomial, as n gets large and p gets small, so that the mean = n*p remains constant, the binomial converges to the Poisson ...  and, the Poisson converges to the normal.  So, if the event of interest is somewhat "rare" and the denominator is very large  (a "measure" of total size...), the binomial and poisson [...] will likely give you similar results, including fitted values". (I hope I cite correctly.)

As I understand correctly the denominator in my case is the number of cells of a type T. It ranges from 1000 to 3000 cells. The number of cells of interest t (the numerator) ranges from few decades (or less) to hundreds.

So, if the hypergeometric distribution is unavailable in PROC GLIMMIX now, I'll model the response variable as binomially distributed. Do I guess correct?

SteveDenham
Jade | Level 19

Given everything you have, either a binomial as planned, or maybe even a beta, if all values are bounded away from zero and one.

Steve Denham

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 3 replies
  • 937 views
  • 1 like
  • 2 in conversation