BookmarkSubscribeRSS Feed
myotis
Calcite | Level 5
Hi, I am new to SAS with limited stats knowledge. I recorded bats at 4 different types of habitats and I visited the same four types of habiats at 5 different sites. I wanted to know whether bat counts differed among habitat types so my model was:

proc genmod;
class habitat site;
model bats=habitat site habitat*site; dist=poisson type3 scale=deviance;
run;

and the data was input as 3 comlumns: bats site habitat

My output is below and my question is:
WHY ARE THERE BLANKS IN THE OUTPUT?
I have read the SAS manual and it really wasn't helpful, it just mentions aliasing but does not say what I am doing wrong or if it is ok to have the blanks. Does it mean anything in terms of interpreting the data?

Thank you verry much to anywone who can help


Analysis Of Parameter Estimates

Standard Wald 95% Chi-
Parameter DF Estimate Error Confidence Limits Square Pr > ChiSq

Intercept 1 1.2510 0.2630 0.7354 1.7665 22.62 <.0001
habitat 1 1 -1.2719 0.5107 -2.2729 -0.2708 6.20 0.0128
habitat 2 1 -0.0273 0.3737 -0.7598 0.7052 0.01 0.9417
habitat 3 1 -1.9311 0.6999 -3.3028 -0.5594 7.61 0.0058
habitat 4 0 0.0000 0.0000 0.0000 0.0000 . .
site 1 1 0.2241 0.3751 -0.5111 0.9593 0.36 0.5502
site 2 1 -0.3330 0.3965 -1.1101 0.4441 0.71 0.4010
site 3 1 -0.3541 0.4510 -1.2380 0.5297 0.62 0.4323
site 4 1 -0.7664 0.4397 -1.6281 0.0954 3.04 0.0813
site 5 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 1 1 1 -0.2532 0.7274 -1.6788 1.1724 0.12 0.7278
site*hab 1 2 1 -1.8486 0.6939 -3.2087 -0.4885 7.10 0.0077
site*hab 1 3 1 0.3482 0.9877 -1.5877 2.2840 0.12 0.7245
site*hab 1 4 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 2 1 1 -1.1652 1.0638 -3.2503 0.9199 1.20 0.2734
site*hab 2 2 1 -1.6519 0.7785 -3.1777 -0.1261 4.50 0.0338
site*hab 2 3 1 1.2806 0.8699 -0.4243 2.9855 2.17 0.1410
site*hab 2 4 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 3 1 1 0.1689 0.8570 -1.5108 1.8486 0.04 0.8437
site*hab 3 2 1 -0.3576 0.6287 -1.5898 0.8746 0.32 0.5695
site*hab 3 3 1 1.1753 0.9074 -0.6033 2.9538 1.68 0.1953
site*hab 3 4 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 4 1 1 0.5127 0.7911 -1.0378 2.0632 0.42 0.5169
site*hab 4 2 1 0.7183 0.5803 -0.4190 1.8557 1.53 0.2158
site*hab 4 3 1 0.7792 1.1431 -1.4612 3.0195 0.46 0.4955
site*hab 4 4 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 5 1 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 5 2 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 5 3 0 0.0000 0.0000 0.0000 0.0000 . .
site*hab 5 4 0 0.0000 0.0000 0.0000 0.0000 . .
Dispersion 1 0.1289 0.0816 -0.0310 0.2888 Message was edited by: myotis
1 REPLY 1
Susan
Calcite | Level 5
By "blanks" do you mean the periods "." that indicate missing values and are associated with df=0 and Estimate=0?

If so, then those are not indicative of a problem per se (although you do, I believe, have a problem with your model specification, which I'll address below). The short answer is that by default GENMOD includes an intercept and uses GLM (reference-cell) coding where the last level of each factor is the reference. The parameter for the reference level (e.g., habitat 4) is *set to zero*; thus, the parameter estimate has SE=0, and the other statistics fall out accordingly. See the "Parameterization Used in PROC GENMOD" section in the SAS/STAT GENMOD documentation for a wealth of useful details.

However, I think you still have a problem: You have 20 observations. The MODEL statement instructs GENMOD to estimate the following parameters:

1 intercept
3 habitat
4 site
12 habitat*site
--------------------
20 parameters total (as listed in your output)

which leaves you with zero df for any stat tests.

To obtain a significance test of whether means are equal for all levels of some fixed-effects factor (like HABITAT), you need appropriate replications of HABITAT. I might guess that you are thinking of SITEs as replications of HABITAT. If so, then your model statement should look more like

model bats=habitat / dist=poisson type3 scale=deviance;

Something else to think about is whether your count data follow the Poisson distribution well. For the Poisson, the mean is equal to the variance; often counts have variance greater than the mean, which is known as over-dispersion. Sometimes a negative binomial is appropriate. Sometimes the counts are high enough that a normal distribution fits well. Sometimes there are too many zeros ("zero-inflated"). Counts are not always Poisson; you just have to dive in and find out.

HTH,
Susan

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1240 views
  • 0 likes
  • 2 in conversation