<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc Logistic for categorical variables in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194411#M10342</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You recommend backward or forward selection? I don't want to remove a variable. I want to remove that level from a variable. It may overestime / underestimate my predicted probability. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 28 May 2015 18:16:31 GMT</pubDate>
    <dc:creator>Ujjawal</dc:creator>
    <dc:date>2015-05-28T18:16:31Z</dc:date>
    <item>
      <title>Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194409#M10340</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I am using CLASS statement with PARAM =REF option in proc logistic to include categorical variables. My question - When i run PROC LOGISTIC with Selection = STEPWISE, it does not check significance of LEVELS (GROUPS) of a categorical variable. It only checks whether a caterical variable as a whole is significant or not. In other words, even if a category of a categorical variable is insignificant, it does not exclude it. But if create dummy variables with reference category manually, it removes the dummy variable that is insignificant. I understand it considers it as a separate variable itself. But isnt it statistically incorrect? Any workaround?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 27 May 2015 18:11:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194409#M10340</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-05-27T18:11:31Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194410#M10341</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Everything in STEPWISE is, at best, highly questionable and, at worst, outright wrong.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;However, here, you have shown that you can make stepwise behave in either of two ways: Treat the categorical variable as a single variable or treat each level as a single variable.&amp;nbsp; I recommend the first. Perhaps you want to exclude any variable that is insignificant at &lt;EM&gt;any&lt;/EM&gt; level?&amp;nbsp; I think that would be an (added) mistake, but you could certainly do it by&amp;nbsp; hand (e.g. by removing that variable from the list). &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 10:55:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194410#M10341</guid>
      <dc:creator>plf515</dc:creator>
      <dc:date>2015-05-28T10:55:18Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194411#M10342</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You recommend backward or forward selection? I don't want to remove a variable. I want to remove that level from a variable. It may overestime / underestimate my predicted probability. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 18:16:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194411#M10342</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-05-28T18:16:31Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194412#M10343</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It appears that you want to collapse levels within a categorical variable, but I may be misinterpreting.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Why would you want to do that?&amp;nbsp; Please explain.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 18:27:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194412#M10343</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-05-28T18:27:01Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194413#M10344</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It's a marketing (churn) model. Most of the significant variables are continuous and only two character variables are appearing and they make sense in terms of business logic and statistical significance. So i was just checking their significance so i put them in CLASS statement with PARAM = REF option. And run stepwise, some levels are coming out insignificant at 5% level, even 10% level. SO i thought better to ignore these categories (levels). But SAS does not check levels while selecting variables via STEPWISE or any selection technique. I guess it's better to ignore these levels and make model more &lt;SPAN style="font-size: 13.3333330154419px;"&gt;parsimonious&amp;nbsp; &lt;/SPAN&gt;with low degree of freedom.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 18:53:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194413#M10344</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-05-28T18:53:54Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194414#M10345</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;It sounds like you are looking for the SPLIT option, which is supported in the CLASS statement of HPLOGISITC and HPGENSELECT.&lt;/P&gt;&lt;P&gt;&lt;A class="active_link" href="http://support.sas.com/documentation/cdl/en/stathpug/67524/HTML/default/viewer.htm#stathpug_introcom_stat_sect003.htm" title="http://support.sas.com/documentation/cdl/en/stathpug/67524/HTML/default/viewer.htm#stathpug_introcom_stat_sect003.htm"&gt;SAS/STAT(R) 13.2 User's Guide: High-Performance Procedures&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I think most (all?) of the HP regression procedures that support variable selection also support the SPLIT option.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 19:30:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194414#M10345</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2015-05-28T19:30:42Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194415#M10346</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks! No, i don't want any interaction between variables. It's a dummy variable with K-1 coding. Setting one value as a reference category. And then evaluating significance of each categories of a variable.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 21:21:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194415#M10346</guid>
      <dc:creator>Ujjawal</dc:creator>
      <dc:date>2015-05-28T21:21:51Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194416#M10347</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;No, I don't think it's better to delete some levels of a categorical variable.&amp;nbsp; That winds up being an uninterpretable model.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;E.g. suppose the variable is race and you have White, Black, Asian, Other.&amp;nbsp; Suppose only White and Asian are significant.&amp;nbsp; Then if you delete the other levels, you are comparing Whites to Asians without controlling for Black or Other.&amp;nbsp; Keep all levels.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Parsimony is often the enemy&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 28 May 2015 22:13:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194416#M10347</guid>
      <dc:creator>plf515</dc:creator>
      <dc:date>2015-05-28T22:13:55Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194417#M10348</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I was wrong. The doc says that the SPLIT option is only available for HPREG.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 29 May 2015 13:12:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194417#M10348</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2015-05-29T13:12:13Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194418#M10349</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Good point, Peter, about parsimony.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Proceeding from the maxim "All models are wrong, but some models are useful" using parsimony as the only tool to select a model is, at least to me, akin to choosing the nearest rock as a weapon when a dragon attacks, while ten feet farther away is a sword designed especially for dragon slaying.&amp;nbsp; It may take a little more work to get to the sword, and it takes some skill to use it, but one is far likelier to be happy with the results.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 29 May 2015 13:24:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/194418#M10349</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-05-29T13:24:39Z</dc:date>
    </item>
    <item>
      <title>Re: Proc Logistic for categorical variables</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/231060#M12179</link>
      <description>&lt;P style="margin: 0in 0in 10pt;"&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;It is the same thing. Since your reference level is not part of your regression (dropped), removing insignificant dummy is essentially the same as combining it with your reference. So you just have new reference variable implicitly.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Oct 2015 21:31:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Proc-Logistic-for-categorical-variables/m-p/231060#M12179</guid>
      <dc:creator>eisforendo</dc:creator>
      <dc:date>2015-10-21T21:31:59Z</dc:date>
    </item>
  </channel>
</rss>

