Hello, I am trying to model a continuous outcome variable which is highly skewed. I have several predictor variables in the model both continuous and categorical. The q-q plot of the residuals is shown below. As you can see, the normality assumption is clearly violated. I tried log transforming the outcome variable but it doesn't seem to fix the problem. Any body has an idea of how to remedy this issue ? Does the central limit theorem apply here? Thanks. Here is the code used: proc glmselect data=b; class a b c d e / param=reference; model y=a b c d e f ; output out=check r=residuals; run; proc univariate data=check; var residuals; histogram residuals / normal kernel; qqplot residuals / normal(mu=est sigma=est); run;
... View more