<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Indicator Variable in PROC REG in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107519#M5695</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Ok, let's focus on predictive modeling for this question.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When we create a regression model for prediction, don't all coefficients included in the model need to be significant?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 26 Oct 2012 18:01:58 GMT</pubDate>
    <dc:creator>kurofufu</dc:creator>
    <dc:date>2012-10-26T18:01:58Z</dc:date>
    <item>
      <title>Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107508#M5684</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I create a indicator variable X1 to set 0 for group A and 1 for group B and then run proc reg on X1 along with other continuous X variables and both intercept and X1's coefficient are significant. But if I code it as 1 for group A and 0 for group B, then intercept becomes not significant. So what happens here? How to explain this?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 06:53:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107508#M5684</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T06:53:20Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107509#M5685</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;When parameterization of your model changes, the meaning of your parameters changes, and thus the statistical significance can change as well.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So in the first case, the intercept has the meaning "what is the Y when all your continuous variables are 0 at group A" and in the second case, the intercept has the meaning "what is the Y when all your continuous variables are zero at group B".&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 13:26:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107509#M5685</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2012-10-26T13:26:39Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107510#M5686</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Use regression equations to explain my question&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;First case:&amp;nbsp; Y= b0 + b1 * X1 + ....&lt;/P&gt;&lt;P&gt;2nd case:&amp;nbsp;&amp;nbsp; Y = c0 + c1 * X1 + ...&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;b0, b1 are significant&lt;/P&gt;&lt;P&gt;c0 is not significant, c1 is signficant&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So which one I should use? Which one is correct?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 16:32:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107510#M5686</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T16:32:22Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107511#M5687</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;They are both correct! As I already explained. The interecepts b0 and c0 are not measuring the same thing&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Your equations leave out the term that accounts for the main effect of changing from group a to group b or vice versa&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So first case is really Y = b0 + b1*(group=B) + b2*x1 + b3*x2 + ...&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;and c0 = b0 + b1*(group=B) and b0 = c0 + c1 * (group=A) &amp;lt;=== c0 is not equal to b0, they are to be interpreted differently, they measure different things&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 16:41:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107511#M5687</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2012-10-26T16:41:32Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107512#M5688</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I understand c0 not equal to b0 and two equations are equivalent agebraically. But since c0 is not significant, how can we adopt the second equation?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 16:48:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107512#M5688</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T16:48:10Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107513#M5689</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Don't you need to report significance information when presenting a regression equation?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 17:03:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107513#M5689</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T17:03:51Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107514#M5690</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Yes, the equations are equivalent. The parts of the equation are not equivalent.&lt;/P&gt;&lt;PRE __jive_macro_name="quote" class="jive_text_macro jive_macro_quote"&gt;
&lt;P&gt;But since c0 is not significant, how can we adopt the second equation?&lt;/P&gt;
&lt;/PRE&gt;&lt;P&gt;It's just as valid as the first equation. You continue to confuse the validity of the equation, with the meaning of individual terms.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It is up to you to understand how to interpret it properly. Perhaps instead of reporting intercepts, which is causing this confusion, you should be reporting the value, and the statistical significance, of the delta between group A and group B, which I think is simply c0-b0. That seems like a better quantity to report.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 17:09:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107514#M5690</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2012-10-26T17:09:13Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107515#M5691</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Sir, we are creating a regression equation for prediction, not for comparison of two groups.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 17:12:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107515#M5691</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T17:12:32Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107516#M5692</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The way you have stated your problem, you are very much comparing two groups.&amp;nbsp; I believe a clearer statement of your objectives is needed, as it is very obvious you are missing PaigeMiller's point, which seems perfectly obvious to me.&amp;nbsp; Your parameterization of the indicator variables means that the two groups will have different intercept-like terms (overall intercept plus intercept due to group).&amp;nbsp; Consequently, it is not at all surprising that the results are significant in one case, and not in the other. See PaigeMiller's response:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;They are both correct! As I already explained. The interecepts b0 and c0 are not measuring the same thing&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Your equations leave out the term that accounts for the main effect of changing from group a to group b or vice versa&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;So first case is really Y = b0 + b1*(group=B) + b2*x1 + b3*x2 + ...&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;and c0 = b0 + b1*(group=B) and b0 = c0 + c1 * (group=A) &amp;lt;=== c0 is not equal to b0, they are to be interpreted differently, they measure different things&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So, you need to think along the following: Are the responses in the two groups parallel--thus the equation would differ only in the intercept for the two groups?&amp;nbsp; Or is there an interaction between group and the other predictor variables?&amp;nbsp; In this case, I would strongly recommend using one of the SAS procedures which has a CLASS option for your regression, such as GLM, MIXED, GENMOD, GLIMMIX, and not using indicator variables. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 17:39:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107516#M5692</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2012-10-26T17:39:18Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107517#M5693</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;PRE __jive_macro_name="quote" class="jive_text_macro jive_macro_quote"&gt;
&lt;P&gt;Sir, we are creating a regression equation for prediction, not for comparison of two groups.&lt;/P&gt;
&lt;/PRE&gt;&lt;P&gt;Okay, then why the concern about the different intercepts? As you said, the models are equivalent. Either will give you the same predicted values.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As I have pointed out, and now as Steve seems to be pointing out, you can create models for prediction, or you can create models for understanding the individual terms (or both). Do NOT confuse the two. If you want a predictive model, then you choose either, and you report the Overall F as its level of significance. If you want to understand the individual terms, you report the tests of the individual model coefficients, with appropriate interpretation. (and of course you can do both)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You keep wandering back and forth between obtaining predictive model, and obtaining understanding of the individual terms.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 17:44:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107517#M5693</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2012-10-26T17:44:18Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107518#M5694</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;GLM also produces the same result as REG.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 17:45:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107518#M5694</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T17:45:32Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107519#M5695</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Ok, let's focus on predictive modeling for this question.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When we create a regression model for prediction, don't all coefficients included in the model need to be significant?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 18:01:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107519#M5695</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T18:01:58Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107520#M5696</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;With regards to the Intercept(s), I would say "No". Leave them in the model, even if they are not statistically significant. (I expect others to disagree with this, but that is my position on the matter)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You might want to read "Analysis of Messy Data, Volume 1, Designed Experiments" by Milliken and Johnson. Even though yours is not a designed experiment, they talk about relevant issues in Chapter 9. In fact, they speak of the "Means Model", which is a distinctly different parameterization than the model you get through SAS. In the "Means Model", all these issues go away. There is a distinct coefficient for the intercept of Group A, and a distinct coefficient for the intercept of Group B. And then, it doesn't matter whether you set A to be 0 and B to be 1, or the other way around.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 18:27:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107520#M5696</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2012-10-26T18:27:42Z</dc:date>
    </item>
    <item>
      <title>Re: Indicator Variable in PROC REG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107521#M5697</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;thanks for the great answer, pagemiller.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 26 Oct 2012 18:33:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Indicator-Variable-in-PROC-REG/m-p/107521#M5697</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-10-26T18:33:02Z</dc:date>
    </item>
  </channel>
</rss>

