<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Tolerance and Variance Inflation Factor in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863311#M42687</link>
    <description>&lt;P&gt;A predictor is collinear if it is strongly correlated with is some linear combination of the other predictors. It is not like a correlation matrix where each cell of the matrix is a correlation of just one variable with one other variable. Collinearity can be more complex by involving multiple variables. So, a matrix of values like you want is not possible. You might find the results produced by the COLLIN or COLLINOINT options more helpful. As noted in "Collinearity Diagnostics" in the Details section of the PROC REG documentation, the results include the condition numbers, which are derived from the eigenvalues of X'X. If a condition number is much larger than 10, that indicates that collinearity exists. The eigenvector values displayed next to such a large condition number describe the relation among the individual predictors. See &lt;A href="http://support.sas.com/kb/32471" target="_self"&gt;this note&lt;/A&gt; that discusses collinearity in generalized linear models (the ordinary regression model is just one type) and shows an example of using the COLLIN and COLLINOINT options to detect and understand collinearity in a model.&lt;/P&gt;</description>
    <pubDate>Thu, 09 Mar 2023 21:56:53 GMT</pubDate>
    <dc:creator>StatDave</dc:creator>
    <dc:date>2023-03-09T21:56:53Z</dc:date>
    <item>
      <title>Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863264#M42682</link>
      <description>&lt;P&gt;According to ChatGPT (&lt;span class="lia-unicode-emoji" title=":thinking_face:"&gt;🤔&lt;/span&gt;) I can use the TOL and VIF options in PROC REG to assess multicollinearity.&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;PROC REG DATA=mydata;
MODEL dependent_variable = independent_variable1 independent_variable2 independent_variable3 / &lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;TOL VIF&lt;/STRONG&gt;&lt;/FONT&gt;;
RUN;
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;The tolerance and VIF values in the resulting table are for each&amp;nbsp;independent variable w/ the&amp;nbsp;dependent variable, correct?&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2023-03-09 at 11.33.35 AM.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81273i2CDEFCE1FED36F7B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Screenshot 2023-03-09 at 11.33.35 AM.png" alt="Screenshot 2023-03-09 at 11.33.35 AM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;But that's not what I'm seeking. I'm interested in multicollinearity between&amp;nbsp;independent variables, not between each&amp;nbsp;independent variable and the&amp;nbsp;dependent variable.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm trying to produce a matrix similar to the correlation matrix in PROC CORR, but with Tolerance and VIF.&lt;/P&gt;
&lt;P&gt;Any recommendations for how to do this?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Mar 2023 19:37:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863264#M42682</guid>
      <dc:creator>_maldini_</dc:creator>
      <dc:date>2023-03-09T19:37:55Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863311#M42687</link>
      <description>&lt;P&gt;A predictor is collinear if it is strongly correlated with is some linear combination of the other predictors. It is not like a correlation matrix where each cell of the matrix is a correlation of just one variable with one other variable. Collinearity can be more complex by involving multiple variables. So, a matrix of values like you want is not possible. You might find the results produced by the COLLIN or COLLINOINT options more helpful. As noted in "Collinearity Diagnostics" in the Details section of the PROC REG documentation, the results include the condition numbers, which are derived from the eigenvalues of X'X. If a condition number is much larger than 10, that indicates that collinearity exists. The eigenvector values displayed next to such a large condition number describe the relation among the individual predictors. See &lt;A href="http://support.sas.com/kb/32471" target="_self"&gt;this note&lt;/A&gt; that discusses collinearity in generalized linear models (the ordinary regression model is just one type) and shows an example of using the COLLIN and COLLINOINT options to detect and understand collinearity in a model.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Mar 2023 21:56:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863311#M42687</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2023-03-09T21:56:53Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863585#M42706</link>
      <description>&lt;P&gt;Since you have proposed the "matrix version of the indicators of collinearity", I think it necessary for you to delve a bit deeper into the essence of tolerance and variance inflation factor (VIF).&lt;/P&gt;
&lt;P&gt;The essence I wish to talk about is:&amp;nbsp;&lt;STRONG&gt;How are the two statistics computed?&lt;/STRONG&gt; I think you will find it unnecessary to generate a "matrix version of the indicators of collinearity" once you have mastered the way that the two statistics are computed (at least I think so).&lt;/P&gt;
&lt;P&gt;Before we talk about the two statistics, let's talk about the definition of collinearity first. In short, for a given set of variables, namely &lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 94px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81366iC19EDC1B207A5F35/image-dimensions/94x31?v=v2" width="94" height="31" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;, if a series of constants, &lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 85px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81367i069B28E91A0EAF6F/image-dimensions/85x28?v=v2" width="85" height="28" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;is existent in making this equation true: &lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 180px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81368iC9324215BB1E3197/image-dimensions/180x26?v=v2" width="180" height="26" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;, then we say that collinearity exists among &lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 94px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81366iC19EDC1B207A5F35/image-dimensions/94x31?v=v2" width="94" height="31" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;.&lt;/P&gt;
&lt;P&gt;Let's talk about tolerance now. How is it computed? In short, if we want to calculate the tolerance of&amp;nbsp;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 26px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81369i17892B552D166051/image-dimensions/26x36?v=v2" width="26" height="36" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;&amp;nbsp;(p=1, 2, 3, ..., &lt;EM&gt;n&lt;/EM&gt;), then we use&amp;nbsp;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 26px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81369i17892B552D166051/image-dimensions/26x36?v=v2" width="26" height="36" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;&amp;nbsp;as the dependent variable and all of the other variables (i.e.&amp;nbsp;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 284px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81370iCD02443BAF8A2420/image-dimensions/284x39?v=v2" width="284" height="39" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;) as the independent variable to form a multivariate linear regression. We can calculate a regression coefficient for this linear regression.&amp;nbsp;&lt;STRONG&gt;Tolerance is the result of 1 subtracts that regression&lt;/STRONG&gt; &lt;STRONG&gt;coefficient.&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;As for variance inflation index, understanding that statistic would be easy once you've come to know the calculation formula of tolerance.&amp;nbsp;&lt;STRONG&gt;VIF is the inverse of tolerance (i.e. &lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Eqn001.gif" style="width: 72px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/81371i73E59612B39B5A76/image-size/large?v=v2&amp;amp;px=999" role="button" title="Eqn001.gif" alt="Eqn001.gif" /&gt;&lt;/span&gt;).&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;So now you can see, for each independent variable, we can generate a pair of statistics that measure its collinearity. Since the tolerance as well as VIF of each and every independent variable take both the information of that variable and other independent variables in the model into account simultaneously, there is no need for you to compute a matrix that specifically tells you the collinearity between one independent variable and another one.&lt;/P&gt;
&lt;P&gt;In addition to tolerance and VIF, conditional index is also another indicator of collinearity. The computation of condition index is more complex than tolerance and VIF, since eigenvectors are key players in the computation of conditional index. You need mathematical knowledge on matrix algebra before you can understand the computation of condition index. From a practical perspective, a condition index larger than 10 suggests collinearity, while a condition index larger than 30 suggests severe collinearity. You can let SAS compute condition index by using the method demonstrated by&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13633"&gt;@StatDave&lt;/a&gt;. In practice, SAS generates a table (two tables if you specify the collinoint option) and I usually look from the bottom of the table(s) to the top, since the largest condition index is at the bottom of the table(s). If the largest condition index is smaller than 10, then the result suggest that no collinearity exist.&lt;/P&gt;
&lt;P&gt;Collinearity is an aspect of statistical diagnostics (or regression diagnostics). You can refer to a book on multivariate statistics or a monograph on statistical diagnostics if you want more information. But please be aware of the fact that many statistical methods (like the computation of condition index) are based on matrix algebra, so it may be necessary in some circumstances that you master (at least know something about) matrix algebra before you study statistics.&lt;/P&gt;</description>
      <pubDate>Sat, 11 Mar 2023 08:21:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863585#M42706</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-03-11T08:21:08Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863593#M42707</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;The tolerance and VIF values in the resulting table are for each independent variable w/ the dependent variable, correct?&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Incorrect.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;VIF is independent of Y. It is a measure of how much each independent variable is affected by multi-collinearity — specifically how much the variance of the estimate of the coefficient of that factor is inflated by multi-collinearity.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Same for tolerance. 1/Tolerance=VIF&lt;/P&gt;</description>
      <pubDate>Sat, 11 Mar 2023 12:09:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863593#M42707</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2023-03-11T12:09:56Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863840#M42713</link>
      <description>Is there supposed to be an "or" here: "A predictor is collinear if it is strongly correlated with, OR is some linear combination of the other predictors."&lt;BR /&gt;</description>
      <pubDate>Mon, 13 Mar 2023 17:41:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863840#M42713</guid>
      <dc:creator>_maldini_</dc:creator>
      <dc:date>2023-03-13T17:41:50Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863842#M42714</link>
      <description>&lt;P&gt;I can't speak for&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13633"&gt;@StatDave&lt;/a&gt;&amp;nbsp;but I would take the second ocurrence of the word "is" out of that sentence. So in my opinion it should read&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;"A predictor is collinear if it is strongly correlated with some linear combination of the other predictors"&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2023 17:46:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863842#M42714</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2023-03-13T17:46:48Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863843#M42715</link>
      <description>Thanks</description>
      <pubDate>Mon, 13 Mar 2023 17:48:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863843#M42715</guid>
      <dc:creator>_maldini_</dc:creator>
      <dc:date>2023-03-13T17:48:25Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863848#M42716</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13633"&gt;@StatDave&lt;/a&gt;&amp;nbsp;Thanks for your help here!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So the values resulted from the TOL and VIF options on the MODEL statement in PROC REG are for each factor, but are representative of the combined multicollinearity of all the independent variables in the model, correct?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;According to &lt;A href="https://www.amazon.com/Multivariable-Analysis-Practical-Clinicians-Researchers/dp/0521141079" target="_self"&gt;this textbook&lt;/A&gt;, “In general, small tolerance values, including those below 0.25, are worrisome, and those below 0.10 are serious.” And, “Variance inflation factors greater than 2.5 may be problematic, whereas values greater than ten are serious”&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Given the above, and my objective of wanting to diagnose multicollinearity that needs to be addressed, do the COLLIN or COLLINOINT options, and the condition numbers, offer additional important information that I should be considering?&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2023 18:14:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863848#M42716</guid>
      <dc:creator>_maldini_</dc:creator>
      <dc:date>2023-03-13T18:14:37Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863863#M42717</link>
      <description>Yes, they do provide useful additional information. As I noted earlier, the eigenvector values associated with any large condition number indicating collinearity tell you something about which variables are collinear.</description>
      <pubDate>Mon, 13 Mar 2023 18:34:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863863#M42717</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2023-03-13T18:34:23Z</dc:date>
    </item>
    <item>
      <title>Re: Tolerance and Variance Inflation Factor</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863868#M42718</link>
      <description>Thank you!</description>
      <pubDate>Mon, 13 Mar 2023 18:38:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Tolerance-and-Variance-Inflation-Factor/m-p/863868#M42718</guid>
      <dc:creator>_maldini_</dc:creator>
      <dc:date>2023-03-13T18:38:18Z</dc:date>
    </item>
  </channel>
</rss>

