BookmarkSubscribeRSS Feed
Cyprinus
Calcite | Level 5

Hi.

First of all, I do not consider myself being a stats-guru and my "extended" SAS and stats-knowledge is mostly self-tought. My question deals with the SAS procedure GLIMMIX and, more precisely, the generated output table.

My research question is related to animal sciences and with my statistical analysis I am trying to identify how different feeds and ingredients may interact with one another affect animal performance (my response variable).

 

I thought of analyzing my data using a full factorial GLMM. I am having 2 class variables and I came up with the following model:

 

ods graphics on;
proc glimmix
Data=Data2 plots =pearsonpanel;
class RefDiet TestIngr;
model ADCprotI=RefDiet TestIngr RefDiet*TestIngr / dist=beta
link=logit solution;
lsmeans RefDiet*TestIngr / adjust=bon ilink oddsratio;
run;
ods graphics off;

 

The beta distribution was chosen according to the residual panel, as it seems to provide a better fit and be more appropriate than a gaussian-shaped distribution (response variables are expressed as proportions with values between 0 and 1). With one specific response variable I am now getting standard errors of 0 for all class variables, resulting in infinite t values. As a result, all class variables turn out to likely have a highly significant effect on my response variable. However, something seems wrong here, but I am currently stumped... Does anyone has a good suggestion on how to overcome this issue or am I doing something completely wrong here? Please let me know if you need more information + thanks a lot!

 

 

The result for e.g. fixed effects looks like this (lines should be labelled with: RefDiet TestIngr RefDiet*TestIngr)

 

Type III Tests of Fixed Effects Effect
Num DF Den DF F Value Pr > F
116Infty<.0001
316Infty<.0001
316Infty<.0001

 

 

And for the parameter estimates (just the upper lines, the shown line names are: RefDiet 1   RefDiet 2   TestIngr   10 TestIngr   20 TestIngr   30...)

 

 

Parameter Estimates Effect RefDiet TestIngr Estimate Standard
Error DF t Value Pr > |t|

2.1766

016Infty<.0001
2.4569016Infty<.0001
-0.4995016-Infty<.0001
-0.1410016-Infty<.0001
-0.01848016-Infty<.0001
6 REPLIES 6
PaigeMiller
Diamond | Level 26

You get infinity for a t-test value when there is zero variability in the residuals. In other words, the model has fit the data exactly.

--
Paige Miller
Cyprinus
Calcite | Level 5

Thank you very much for your quick reply, Paige. In other words: would you consider it legitimate to report this outcome "as is" (standard-error = 0 and F-value infinity)? Of course this makes running a multiple comparison test a bit difficult (impossible)...

PaigeMiller
Diamond | Level 26

I'm sure there are some real-world situations where this is a legitimate outcome, but there are also some real-world situations where this is not a legitimate outcome.

 

The zero error situation could arise from mis-specifiying (actually over-specifying) the model (i.e. statistical analysis error), it could also arise from degenerate data (perhaps some error in data collection or data transmission or error in entering the data into the database or computer coding error).

 

It could be caused by a many different things.

--
Paige Miller
Ksharp
Super User

Why not use DIST=BINOMIAL since your link function is LOGIT ?

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

Your model is likely overspecified (too many terms). Also note (very important): the beta distribution is defined for 0 < y < 1. Thus, 0s and 1s are not allowed. All proportions equal to 0 or 1 become missing in the analysis. If you don't have 0s and 1s, then you are fine, but otherwise you are not using all your data. Be careful.

 

If your proportions are for a continuous variable, then binomial is not appropriate.

Cyprinus
Calcite | Level 5

Apologies for the late reply and thanks for the notification. Indeed, binomial would not be appropriate and, likely, I should go with a beta (proportions are for a continuous variable). The model stated above seemed to work for a bunch of variables, but is failing for the one listed above...

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 6 replies
  • 5073 views
  • 0 likes
  • 4 in conversation