<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to get the multicolinearity in SAS to deal with Simpson Paradox? in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737540#M229953</link>
    <description>&lt;OL&gt;
&lt;LI&gt;Some people prefer to look at Tolerance, others look at VIF. Doesn't really matter.&lt;/LI&gt;
&lt;LI&gt;Yes, 1.59 for RunTime indicates the multicollinearity is not having a major impact on the estimate of the RunTime coefficient. Opposite is true for RunPulse, big effect of multicollinearity.&lt;/LI&gt;
&lt;/OL&gt;</description>
    <pubDate>Wed, 28 Apr 2021 11:09:33 GMT</pubDate>
    <dc:creator>PaigeMiller</dc:creator>
    <dc:date>2021-04-28T11:09:33Z</dc:date>
    <item>
      <title>Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737532#M229950</link>
      <description>&lt;P&gt;Hi all SAS Experts,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When regressing nonexperimental data, I faced the Simpson Paradox. Simple description from this &lt;A href="https://stats.stackexchange.com/questions/1580/regression-coefficients-that-flip-sign-after-including-other-predictors?newreg=96eee013a9d04e22bd58fa82fa62043a" target="_self"&gt;post&lt;/A&gt;:&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;Imagine&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;You run a linear regression with four numeric predictors (IV1, ..., IV4)&lt;/LI&gt;
&lt;LI&gt;When only IV1 is included as a predictor the standardised beta is&lt;CODE&gt;+.20&lt;/CODE&gt;&lt;/LI&gt;
&lt;LI&gt;When you also include IV2 to IV4 the sign of the standardised regression coefficient of IV1 flips to&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;CODE&gt;-.25&lt;/CODE&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;(i.e., it's become negative)&lt;/LI&gt;
&lt;/UL&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;In my case, when I run the linear regression:&lt;/P&gt;
&lt;P&gt;y=b1x1+b2x2+b3x3.&lt;/P&gt;
&lt;P&gt;b1 get the positive value. However, when I add the variable x4 to the regression, b1 becomes negative.&lt;/P&gt;
&lt;P&gt;I look at a &lt;A href="https://stats.stackexchange.com/questions/1580/regression-coefficients-that-flip-sign-after-including-other-predictors?newreg=96eee013a9d04e22bd58fa82fa62043a" target="_self"&gt;comment of this topic,&lt;/A&gt;one of the reason maybe due to the multicollinearity among variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Phil_NZ_0-1619605616279.png" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/58764iCC4508735FA44BD3/image-size/large?v=v2&amp;amp;px=999" role="button" title="Phil_NZ_0-1619605616279.png" alt="Phil_NZ_0-1619605616279.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;I am wondering how to check whether Simpson's paradox in my case is caused by multicollinearity? How to generate the multicollinearity table among variables using SAS?&lt;/P&gt;
&lt;P&gt;Many thanks and warm regards.&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 10:38:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737532#M229950</guid>
      <dc:creator>Phil_NZ</dc:creator>
      <dc:date>2021-04-28T10:38:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to get the multicolinearity in SAS to deal with Simpson Paradox?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737533#M229951</link>
      <description>&lt;P&gt;Multicollinearity can be checked via the VIF calculations in PROC REG, high values of VIF indicate multicollinearity is a serious problem. (The issue of what counts as "high values of VIF" isn't that clear, some people say high is &amp;gt;10 others say high is &amp;gt;5, and I have also seen &amp;gt;3)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Example:&amp;nbsp;&lt;A href="https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.4/statug/statug_reg_details24.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.4/statug/statug_reg_details24.htm&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 10:38:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737533#M229951</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-04-28T10:38:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to get the multicolinearity in SAS to deal with Simpson Paradox?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737538#M229952</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you for your reply and the SAS document. When reading the document, there are two things popping up on my head:&lt;/P&gt;
&lt;P&gt;1. I am wondering why they have the VIF but they still show the Tolerance&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Phil_NZ_0-1619607815423.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/58765iA4FE5DD7CAD387E2/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Phil_NZ_0-1619607815423.png" alt="Phil_NZ_0-1619607815423.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;2. So, VIF shows the multicollinearity between this variable and all other regressors, am I correct? For example, in the Table above, 1.59 is the multicollinearity between RunTime and all other independent variables?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Many thanks and warm regards.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 11:05:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737538#M229952</guid>
      <dc:creator>Phil_NZ</dc:creator>
      <dc:date>2021-04-28T11:05:50Z</dc:date>
    </item>
    <item>
      <title>Re: How to get the multicolinearity in SAS to deal with Simpson Paradox?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737540#M229953</link>
      <description>&lt;OL&gt;
&lt;LI&gt;Some people prefer to look at Tolerance, others look at VIF. Doesn't really matter.&lt;/LI&gt;
&lt;LI&gt;Yes, 1.59 for RunTime indicates the multicollinearity is not having a major impact on the estimate of the RunTime coefficient. Opposite is true for RunPulse, big effect of multicollinearity.&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Wed, 28 Apr 2021 11:09:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737540#M229953</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-04-28T11:09:33Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737558#M229956</link>
      <description>&lt;P&gt;That is normal and not really due to multicolinearity. When you add another variable into model the coefficient of v1 would change , it is all depend on data and GLM . So nothing to worry about . MODEL is build on data . If your data was not real good , Paradox would happen.&lt;/P&gt;
&lt;P&gt;I also talk to&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp; about this topic (coefficient is changed). He also has the same opinion with me .&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 12:41:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737558#M229956</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2021-04-28T12:41:17Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737561#M229957</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408"&gt;@Ksharp&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;That is normal and not really due to multicolinearity. When you add another variable into model the coefficient of v1 would change , it is all depend on data and GLM . So nothing to worry about . MODEL is build on data . If your data was not real good , Paradox would happen.&lt;/P&gt;
&lt;P&gt;I also talk to&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp; about this topic (coefficient is changed). He also has the same opinion with me .&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;If there is no multicollinearity, and all variables are orthogonal to each other, the coefficient of x1 will not change when another variable is added to the model. So the only explanation that I know of for coefficients changing is (partial) multicollinearity. What is another explanation for coefficients changing (and signs of the coefficient changing as well)?&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;I think it is something to worry about, especially if the coefficients change sign when variables are added.&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 12:57:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737561#M229957</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-04-28T12:57:42Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737566#M229959</link>
      <description>"and all variables are orthogonal to each other, "&lt;BR /&gt;In real world , any two variables have correlation . You will NOT be able to find orthogonal variables .&lt;BR /&gt;&lt;BR /&gt;"if the coefficients change sign when variables are added."&lt;BR /&gt;It is all about model and data. &lt;BR /&gt;For any kind of statistic model , when you add one more variable, the variance of model would be assign to this variable (more or less). it is inevitable .</description>
      <pubDate>Wed, 28 Apr 2021 13:14:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737566#M229959</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2021-04-28T13:14:39Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737573#M229965</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408"&gt;@Ksharp&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;"and all variables are orthogonal to each other, "&lt;BR /&gt;In real world , any two variables have correlation . You will NOT be able to find orthogonal variables .&lt;BR /&gt;&lt;BR /&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;If we are discussing the math, you can have orthogonal variables and understand the effect of adding another variable. If the variables are orthogonal, adding a variable does not cause the other coefficients to change. If the variables are not orthogonal, the effect is that adding another variable causes coefficients to change, and the only reason I know is multicollinearity.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In the real world, you can have orthogonal variables in a designed experiment.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;BR /&gt;"if the coefficients change sign when variables are added."&lt;BR /&gt;It is all about model and data. &lt;BR /&gt;For any kind of statistic model , when you add one more variable, the variance of model would be assign to this variable (more or less). it is inevitable .&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Okay, I agree with the above if you don't have orthogonal variables. But earlier you said it is "&lt;SPAN&gt;nothing to worry about"&amp;nbsp;&lt;/SPAN&gt;and you said it is "&lt;SPAN&gt;not really due to multicolinearity", those are the parts I disagree with. In my opinion, it is multicollinearity, and it is something to worry about.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 13:36:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737573#M229965</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-04-28T13:36:07Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737576#M229968</link>
      <description>So how do you judge multicollinearity?  rho=0.3 or rho=o.8 ? event with rho=0.1 , the coefficient would also be change into another sign. Sometimes with good data , the coefficient would not change, sometimes would be .It is depend on your data , it is not the fault of GLM .</description>
      <pubDate>Wed, 28 Apr 2021 13:46:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737576#M229968</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2021-04-28T13:46:29Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737749#M230018</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp; and&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408"&gt;@Ksharp&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you so much for the discussion so far. I really and truly appreciate this. I try my best to deal with my data to see what really going on?&lt;/P&gt;
&lt;P&gt;First, I run the regression by putting variable by variable to see which variable cause the reversion in sign (I ran it on Stata because Stata provide me the tool to arrange the aesthetic table quickly by using &lt;STRONG&gt;est sto&lt;/STRONG&gt; and &lt;STRONG&gt;esttab&lt;/STRONG&gt; code.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Phil_NZ_1-1619647482061.png" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/58837i4F231D5B1080376F/image-size/large?v=v2&amp;amp;px=999" role="button" title="Phil_NZ_1-1619647482061.png" alt="Phil_NZ_1-1619647482061.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As can be seen, the coefficients of my variable of interest pt changed its sign right at column 8. So, it seems that&amp;nbsp;wCAP_INT wGRO_MAR are the two caused the sign reversion. So following&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;'s suggestion, I get the VIF based on the code&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc reg data=merge_treat_con_copy;
	model wINV_DAY= pt wFIRM_SIZE LNGDP UNEMPLOYMENT  INFLATION 
		wTANGIBILITY FCF wLOG_MVE wCAP_INT wGRO_MAR/ tol vif collin;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;And the result is as below:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Phil_NZ_2-1619648011174.png" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/58838iC9590C3D5609F9A1/image-size/large?v=v2&amp;amp;px=999" role="button" title="Phil_NZ_2-1619648011174.png" alt="Phil_NZ_2-1619648011174.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;However, in my regression above, I control for firm (variable &lt;STRONG&gt;Type&lt;/STRONG&gt;) and year (variable &lt;STRONG&gt;yr&lt;/STRONG&gt;) fixed effect, but I do not know how to put the firm and year fixed effect to proc reg above. I normally know we can put it in proc glm&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc glm data=merge_treat_con_copy;
    class type yr;
    model wINV_DAY= pt wFIRM_SIZE LNGDP UNEMPLOYMENT  INFLATION 
		wTANGIBILITY FCF wLOG_MVE wCAP_INT wGRO_MAR type yr/solution ss3;
run;
quit;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;But I cannot run this regression because the log shows that:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;ERROR: Number of levels for some effects &amp;gt; 32767.&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Following &lt;A href="https://communities.sas.com/t5/Statistical-Procedures/PROC-GLM-and-number-of-levels-gt-32767/td-p/720344#M34874" target="_self"&gt;this discussion&lt;/A&gt;,&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13879"&gt;@Reeza&lt;/a&gt;&amp;nbsp; said that it may be because my &lt;STRONG&gt;Type&lt;/STRONG&gt; variable having more than 32K value (and it is true). But I have not yet known about the &lt;STRONG&gt;absorb&lt;/STRONG&gt; yet.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So, can you please tell me how to get the vif and else in proc glm&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Many thanks and warm regards,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 22:29:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737749#M230018</guid>
      <dc:creator>Phil_NZ</dc:creator>
      <dc:date>2021-04-28T22:29:23Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737838#M230059</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;So, can you please tell me how to get the vif and else in proc glm&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;I doubt there is real value in having &amp;gt;32767 levels of a variable. So, my advice is to re-think your design, or combine some of the levels such that there are much fewer than 32767 distinct levels.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Apr 2021 10:25:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737838#M230059</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-04-29T10:25:46Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737848#M230066</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you very much for your help so far.&lt;/P&gt;
&lt;P&gt;I think it is reasonable in my case regarding having &amp;gt;32767 distinct companies. Because I am dealing with all firms in 64 countries from 1990 to 2019, so it makes sense to me (I also compared to some international studies). I just sum up the total number of distinct firms in my sample and it is up to 39,049 firms. So when you control for firm fixed effect, there would be 39,049 - 1 =39048 levels, which is higher than 32767.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It seems that it is the time to consider the absorb option? Does it make sense to you,&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp; ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Warm regards and thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Apr 2021 10:53:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737848#M230066</guid>
      <dc:creator>Phil_NZ</dc:creator>
      <dc:date>2021-04-29T10:53:10Z</dc:date>
    </item>
    <item>
      <title>Re: Getting multicolinearity dealing with Simpson Paradox using SAS?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737854#M230069</link>
      <description>&lt;P&gt;ABSORB allows the model to be fit. You cannot get predicted values if you use the ABSORB statement. You cannot get least squares means of the ABSORBed variables. ABSORB does not help you understand multicollinearity, and it does not allow you to reduce multicollinearity.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;With regard to the fact that you have &amp;gt;39,000 companies, I am still skeptical that having a variable with &amp;gt;39,000 levels in your model is truly valuable. But I don't know your exact problem as well as you know it. Only you can make that decision. If it was me at this point, I would try to combine companies by some criteria, such as Manufacturing, Financial, Retail, ... (you can determine what set of categories makes best sense to you) or by the first three digits of the SIC code, and see if that produces a good fit, and see if the multicollinearity is low. I suspect with &amp;gt;39,000 levels, you will always have high multicollinearity, but again, I don't have your data.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Apr 2021 11:01:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Getting-multicolinearity-dealing-with-Simpson-Paradox-using-SAS/m-p/737854#M230069</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-04-29T11:01:09Z</dc:date>
    </item>
  </channel>
</rss>

