<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Is it a sensible practice to collapse categories of a predictor? in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Is-it-a-sensible-practice-to-collapse-categories-of-a-predictor/m-p/61115#M2847</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Depends on the context of the data. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If for example level 1 is age&amp;lt;30 and level 2 is age between 31 and 60 and level 3 is age &amp;gt;60 then you're simply recoding to age &amp;lt;30 and age&amp;gt;30 which is okay, as long as you don't introduce a bias into your data. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you're combining things that don't make sense ie level 1 is unknown and level 2 is Grade 1 then there it doesn't make sense. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Basically, check if it makes logistic sense to collapse them from a business or interpretative perspective and check if the distribution is significantly different with the predictor ( a chi square usually works) to make sure you interpret things correctly. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I work in Health Care and we routinely do this.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 07 Sep 2011 01:18:10 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2011-09-07T01:18:10Z</dc:date>
    <item>
      <title>Is it a sensible practice to collapse categories of a predictor?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Is-it-a-sensible-practice-to-collapse-categories-of-a-predictor/m-p/61114#M2846</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I used logistic regression to run a model. An explanatory variable called 'location' has 3 levels (1, 2, 3). Level 3 is the reference group. For this analysis, the estimated regression coefficients for level 1 and level 2 are 1.102 and 1.111 respectively. As the values are very close, is it sensible to combine these two levels into a single level to make the model simpler? Or it is better to keep the two levels as what they are separate?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 Sep 2011 00:32:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Is-it-a-sensible-practice-to-collapse-categories-of-a-predictor/m-p/61114#M2846</guid>
      <dc:creator>Ruth</dc:creator>
      <dc:date>2011-09-07T00:32:23Z</dc:date>
    </item>
    <item>
      <title>Is it a sensible practice to collapse categories of a predictor?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Is-it-a-sensible-practice-to-collapse-categories-of-a-predictor/m-p/61115#M2847</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Depends on the context of the data. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If for example level 1 is age&amp;lt;30 and level 2 is age between 31 and 60 and level 3 is age &amp;gt;60 then you're simply recoding to age &amp;lt;30 and age&amp;gt;30 which is okay, as long as you don't introduce a bias into your data. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you're combining things that don't make sense ie level 1 is unknown and level 2 is Grade 1 then there it doesn't make sense. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Basically, check if it makes logistic sense to collapse them from a business or interpretative perspective and check if the distribution is significantly different with the predictor ( a chi square usually works) to make sure you interpret things correctly. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I work in Health Care and we routinely do this.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 Sep 2011 01:18:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Is-it-a-sensible-practice-to-collapse-categories-of-a-predictor/m-p/61115#M2847</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2011-09-07T01:18:10Z</dc:date>
    </item>
  </channel>
</rss>

