BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Hank
Fluorite | Level 6

Hi,

I hope there is anyone to answer this one.. Smiley Happy

When a regression is done, you get lots of free diagrams and histograms that come along with the results. I have attached two examples below. I went to a SAS course in statistics/anova/regression, but I never figured out how to interpret the lowest middle graph in the fit diagnostics table (attached in first file).

Is there anyone who can explain how to interpret these results? I.e. what do "fit mean" and "residual" show, in proportion to "proportion less"? what should the desired shape/form for the curve/line be? How should I interpret the results in the attached fil, as there seems to be a nice nonlinear line?

Most importantly, what remedies and actions should be taken to act on an eventual outcome that is not desired?

These results comes from a regression where I have hoped to rule out anything unwanted in an OLS regression (i.e multicollinearity, heteroscedasticity etc. The response has been box-cox transformed to get linear residuals- which normality test show). Therefore I am a bit worried about the curvature on the graphs.

Best regards,

Hank 


Box-Cox_Y.PNGFit diagnostics.PNG
1 ACCEPTED SOLUTION

Accepted Solutions
Rick_SAS
SAS Super FREQ

There is a description of the Residual-Fit Spread Plot in the PROC REG doc:SAS/STAT(R) 9.3 User's Guide (search for "Residual-Fit")

Basically, you look at the heights of the two plots. If the left side (Fit) is taller than the right side (Residuals), then it means that the variables in the model explain a lot of the variation in the response variable.  If the right side (residuals) are taller, then it means that there is still a lot of unexplained variation.

View solution in original post

2 REPLIES 2
Rick_SAS
SAS Super FREQ

There is a description of the Residual-Fit Spread Plot in the PROC REG doc:SAS/STAT(R) 9.3 User's Guide (search for "Residual-Fit")

Basically, you look at the heights of the two plots. If the left side (Fit) is taller than the right side (Residuals), then it means that the variables in the model explain a lot of the variation in the response variable.  If the right side (residuals) are taller, then it means that there is still a lot of unexplained variation.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 13606 views
  • 1 like
  • 2 in conversation