BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
fatemeh
Quartz | Level 8

Hello all,

Any help will be appreciated to help me find a solution for my problem.

My question is that if we apply proc mixed to find model for multilevel repeated measures while the response variable does not have normal distribution, will the estimates be biased? if yes how can i solve the issue? How can i find out what distribution my response has other than histogram? If my response has exponential or other distribution, how can i find the right distribution?

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

There is no such assumption that the response variable has to be normally distributed.

 

In this type of modeling, the errors have to be independent and identically distributed according to some normal distribution.

 

https://blogs.sas.com/content/iml/2018/08/27/on-the-assumptions-and-misconceptions-of-linear-regress...

--
Paige Miller

View solution in original post

9 REPLIES 9
PaigeMiller
Diamond | Level 26

There is no such assumption that the response variable has to be normally distributed.

 

In this type of modeling, the errors have to be independent and identically distributed according to some normal distribution.

 

https://blogs.sas.com/content/iml/2018/08/27/on-the-assumptions-and-misconceptions-of-linear-regress...

--
Paige Miller
fatemeh
Quartz | Level 8
Thanks for your reply. But because the observations are repeated measures so they are correlated, should the errors be dependent?
PaigeMiller
Diamond | Level 26

I think properly specified repeated measures models handle this type of potential dependency between the repeated measures.

 

Again, it's the errors that have to be independent and normally distributed, not the values of the response variable.

--
Paige Miller
Ksharp
Super User
Yes. I think so. It is about R-side random effect .
Ksharp
Super User

You could check Goodness-of-test , like AIC BIC ....... which is smaller and better to fit data.
@Rick_SAS know more something .

@lvm @StatDave @SteveDenham 

SteveDenham
Jade | Level 19

The information criteria have to be used carefully.  In order to be effective in selecting a model (or a distribution), the data must be identical in the two instances.  This rules out comparing something with an identity link (like lognormal) to something with a log link (like exponential), or normal (identity) vs almost any other distribution modeled by GLIMMIX.

 

A good introduction to the assumptions in mixed modeling can be found in SAS for Mixed Models (Stroup et al.) or Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS by Ed Vonesh.  The latter may be quite useful to you, given your short description of your data.

 

SteveDenham

jiltao
SAS Super FREQ

Independent errors is never one of the model assumptions for mixed models. Normal distribution with the variance covariance matrix of R is one of the model assumptions. You would use the REPEATED statement in PROC MIXED to model the correlated residuals. Or, you could use the RANDOM statement to model the the correlated observations in the response. 

If you do a residual analysis in PROC MIXED, how non-normal does it look like? One way is to use PROC GLIMMIX to model your data if a different distribution (from the exponential family) is more appropriate.

PaigeMiller
Diamond | Level 26

@jiltao wrote:

Independent errors is never one of the model assumptions for mixed models. Normal distribution with the variance covariance matrix of R is one of the model assumptions.

 

Thanks for the correction!

--
Paige Miller
fatemeh
Quartz | Level 8

Thanks a lot for your valuable responses,

My data is multilevel longitudinal repeated measures over time which is not growth data (is not longitudinal growth data) as i understood response variable does not need to be normally distributed, so if i apply glimmix and my response does not look normal how can i find what distribution should i select in dist section of model statement in proc glimmix?

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 9 replies
  • 2473 views
  • 10 likes
  • 5 in conversation