<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Need advise on how to deal with multicollinearity in PROC LOGISTIC in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712733#M34449</link>
    <description>&lt;P&gt;Hello SAS experts,&lt;BR /&gt;My question is regarding multi-collinearity in logistic regression. I have two categorical and two continuous variables. I run the original model using PROC Logistic.&lt;BR /&gt;I wanted to run the full model (4 variables) including interactions, but the model becomes "saturated".&amp;nbsp; I decided to run two separated analyses:&amp;nbsp; 1) one for the two categorical variables + interactions and&amp;nbsp; 2) the other for the continuous variables + interactions.&lt;BR /&gt;The problem with both analyses is the presence of&amp;nbsp; multi-collinearity.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;I think this may be a heresy, but, in order to show you how bad the multi-collinearity is,&amp;nbsp; I run the analysis for the categorical variables in Minitab 19 using the "Binary Logistic Regression" tool, because it provides a compact table with the VIF for each categorical variable and their interactions and automatically shows the diagnostics for the model.&lt;BR /&gt;I am showing the results of the full analysis in the figures below.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="VIF_TO_SAS.png" style="width: 320px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53658i6EAA7EDC41AD636C/image-size/large?v=v2&amp;amp;px=999" role="button" title="VIF_TO_SAS.png" alt="VIF_TO_SAS.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The no interaction model for the categorical variables still shows multi-collinearity (VIF above 1 for all variables).&lt;/P&gt;&lt;P&gt;Given that multi-collinearity is caused by the explanatory variables being correlated, I thought the most simple solution for my data would be to run one logistic regression for each of the variables that I need to evaluate.&lt;BR /&gt;Is that an approach any of you could agree with? If not, is there any better solution you could suggest?&lt;/P&gt;&lt;P&gt;Thank you in advance. I apologize for the Minitab Output.&lt;BR /&gt;Regards,&lt;/P&gt;&lt;P&gt;Marcel&lt;/P&gt;</description>
    <pubDate>Wed, 20 Jan 2021 15:33:34 GMT</pubDate>
    <dc:creator>marcel</dc:creator>
    <dc:date>2021-01-20T15:33:34Z</dc:date>
    <item>
      <title>Need advise on how to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712733#M34449</link>
      <description>&lt;P&gt;Hello SAS experts,&lt;BR /&gt;My question is regarding multi-collinearity in logistic regression. I have two categorical and two continuous variables. I run the original model using PROC Logistic.&lt;BR /&gt;I wanted to run the full model (4 variables) including interactions, but the model becomes "saturated".&amp;nbsp; I decided to run two separated analyses:&amp;nbsp; 1) one for the two categorical variables + interactions and&amp;nbsp; 2) the other for the continuous variables + interactions.&lt;BR /&gt;The problem with both analyses is the presence of&amp;nbsp; multi-collinearity.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;I think this may be a heresy, but, in order to show you how bad the multi-collinearity is,&amp;nbsp; I run the analysis for the categorical variables in Minitab 19 using the "Binary Logistic Regression" tool, because it provides a compact table with the VIF for each categorical variable and their interactions and automatically shows the diagnostics for the model.&lt;BR /&gt;I am showing the results of the full analysis in the figures below.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="VIF_TO_SAS.png" style="width: 320px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53658i6EAA7EDC41AD636C/image-size/large?v=v2&amp;amp;px=999" role="button" title="VIF_TO_SAS.png" alt="VIF_TO_SAS.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The no interaction model for the categorical variables still shows multi-collinearity (VIF above 1 for all variables).&lt;/P&gt;&lt;P&gt;Given that multi-collinearity is caused by the explanatory variables being correlated, I thought the most simple solution for my data would be to run one logistic regression for each of the variables that I need to evaluate.&lt;BR /&gt;Is that an approach any of you could agree with? If not, is there any better solution you could suggest?&lt;/P&gt;&lt;P&gt;Thank you in advance. I apologize for the Minitab Output.&lt;BR /&gt;Regards,&lt;/P&gt;&lt;P&gt;Marcel&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2021 15:33:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712733#M34449</guid>
      <dc:creator>marcel</dc:creator>
      <dc:date>2021-01-20T15:33:34Z</dc:date>
    </item>
    <item>
      <title>Re: Need advise to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712739#M34450</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;I thought the most simple solution for my data would be to run one logistic regression for each of the variables that I need to evaluate.&lt;/SPAN&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;And then what? Now you have some logistic regressions, how do you continue the analysis to say what happens (or how the model predicts) using more than one (or maybe even all) the variables?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Please consider using Partial Least Squares Regression (PROC PLS), which is robust against multicollinearity. Oh ... wait ... that only works for continuous Y variables, not binary Y variables. You could use the Logistic Partial Least Squares method (&lt;A href="https://cedric.cnam.fr/fichiers/RC906.pdf" target="_blank"&gt;https://cedric.cnam.fr/fichiers/RC906.pdf&lt;/A&gt;) which is robust against multicollinearity and works well in my experience, but no SAS code is available, although I think there is an R package which does this.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2021 15:20:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712739#M34450</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-01-20T15:20:44Z</dc:date>
    </item>
    <item>
      <title>Re: Need advise to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712740#M34451</link>
      <description>&lt;P&gt;If there is collinearity among the predictors, it is important to determine what defines the collinearity. See &lt;A href="http://support.sas.com/kb/32471" target="_self"&gt;this note&lt;/A&gt; which produces collinearity statistics for a logistic regression and examines the eigenvectors to determine the nature of the collinearity in the model.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2021 15:24:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712740#M34451</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2021-01-20T15:24:17Z</dc:date>
    </item>
    <item>
      <title>Re: Need advise to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712951#M34457</link>
      <description>&lt;P&gt;Your comment is interesting. Because I have a quasi-complete separation of points. For that reason I used the Firth correction. So now I have to find out if the large VIFs are due to the quasi-complete separation or multi-collinearity.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am having issues with the coding for my categorical variables to use them with proc genmod and proc reg.&lt;/P&gt;&lt;P&gt;My original table has this formatting:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="table3.jpg" style="width: 149px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53725i7868CDC961771AC6/image-size/large?v=v2&amp;amp;px=999" role="button" title="table3.jpg" alt="table3.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In order to use it with proc genmod and proc reg I am planning to it code it like this:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="table2.jpg" style="width: 149px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53726i7BFC9BDCF6B40A35/image-size/large?v=v2&amp;amp;px=999" role="button" title="table2.jpg" alt="table2.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That may not be the right way to do it?&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Marcel&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2021 01:20:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712951#M34457</guid>
      <dc:creator>marcel</dc:creator>
      <dc:date>2021-01-21T01:20:10Z</dc:date>
    </item>
    <item>
      <title>Re: Need advise to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712952#M34458</link>
      <description>&lt;P&gt;Thank you Paige Miller. First, I will try to figure out if the large VIFs are due to the quasi-complete separation of points I found in my data.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2021 01:23:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/712952#M34458</guid>
      <dc:creator>marcel</dc:creator>
      <dc:date>2021-01-21T01:23:30Z</dc:date>
    </item>
    <item>
      <title>Re: Need advise to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/713035#M34461</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/118515"&gt;@marcel&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Your comment is interesting. Because I have a quasi-complete separation of points. For that reason I used the Firth correction. So now I have to find out if the large VIFs are due to the quasi-complete separation or multi-collinearity.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am having issues with the coding for my categorical variables to use them with proc genmod and proc reg.&lt;/P&gt;
&lt;P&gt;My original table has this formatting:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="table3.jpg" style="width: 149px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53725i7868CDC961771AC6/image-size/large?v=v2&amp;amp;px=999" role="button" title="table3.jpg" alt="table3.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In order to use it with proc genmod and proc reg I am planning to it code it like this:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="table2.jpg" style="width: 149px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53726i7BFC9BDCF6B40A35/image-size/large?v=v2&amp;amp;px=999" role="button" title="table2.jpg" alt="table2.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Recoding variable values from a b c ... to 1 2 3 ... makes not the slightest bit of difference if they are still CLASS variables. If you are thinking of turning the variables into continuous variables this way, I think that's a mistake unless they REALLY are continuous or ordinal.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You say you are having "issues", but you don't specify what the "issues" are, or how this re-coding changes anything.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2021 11:45:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/713035#M34461</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-01-21T11:45:51Z</dc:date>
    </item>
    <item>
      <title>Re: Need advise on how to deal with multicollinearity in PROC LOGISTIC</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/713045#M34462</link>
      <description>&lt;P&gt;VIF is for PROC REG. Check CORRB option. And&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp;wrote a blog about it - COV of estimator.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;proc logistic data=sashelp.heart;
class sex;
model status=sex agechddiag ageatstart height weight diastolic/corrb;
run;&lt;/PRE&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="x.png" style="width: 788px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/53734i10E6D51B19E362D3/image-size/large?v=v2&amp;amp;px=999" role="button" title="x.png" alt="x.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2021 12:40:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Need-advise-on-how-to-deal-with-multicollinearity-in-PROC/m-p/713045#M34462</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2021-01-21T12:40:07Z</dc:date>
    </item>
  </channel>
</rss>

