BookmarkSubscribeRSS Feed
j4sanford
Calcite | Level 5

I wanted to know if anyone has found a cut off determinant for assessing multi-collinearity. I used the proc corr statement (which I think is spearman's correlation) but everything I am reading seems to have found no consensus and varies anywhere from above 0.5 to 0.9. Anyone know what a good cut off is?

2 REPLIES 2
SAS_Rob
SAS Employee

You would be best served using the COLLIN option in Proc REG to assess collinearity.


The numbers listed in the "Collinearity Diagnostics" table represent the number
of eigenvalues extracted from the rescaled X`X matrix. They are listed from
largest to smallest.

You can look at the Condition Number to determine if there is any collinearity.
The condition indices are the square roots of the ratio of the largest
eigenvalue to each individual eigenvalue. The largest condition index is the
condition number of the scaled X matrix. Belsey, Kuh, and Welsch (1980) suggest
that, when this number is around 10, weak dependencies may be starting to
affect the regression estimates. When this number is larger than 100, the
estimates may have a fair amount of numerical error (although the statistical
standard error almost always is much greater than the numerical error).
For each variable, PROC REG produces the proportion of the variance of the
estimate accounted for by each principal component. A collinearity problem
occurs when a component associated with a high condition index contributes
strongly (variance proportion greater than about 0.5) to the variance of two or
more variables.

The VIF option in the MODEL statement provides the Variance Inflation Factors
(VIF). These factors measure the inflation in the variances of the parameter
estimates due to collinearities that exist among the regressor (dependent)
variables. There are no formal criteria for deciding if a VIF is large enough
to affect the predicted values, although some authorities (Myers 1990) state
that values exceeding 10 may be cause for concern. The variables with the
larger VIF values may indicate that those variables are the ones involved in
the collinearity.

PaigeMiller
Diamond | Level 26

@j4sanford wrote:

I wanted to know if anyone has found a cut off determinant for assessing multi-collinearity. I used the proc corr statement (which I think is spearman's correlation) but everything I am reading seems to have found no consensus and varies anywhere from above 0.5 to 0.9. Anyone know what a good cut off is?


It is not a good idea to create multiple threads with the same question. People read one thread but are unaware of the other thread, and they don't get the full answer.

 

May I suggest that all further discussion go into your original thread at https://communities.sas.com/t5/Mathematical-Optimization/Collinearity-Diagnostics-Using-the-Informat... where there are already other answers.

--
Paige Miller

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1500 views
  • 0 likes
  • 3 in conversation