<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to reduce number of levels in an input variable in SAS EM in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361617#M5365</link>
    <description>&lt;P&gt;Usually I'd say use some sort of binning but you have ordinal variables.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You need to group them in some logical manner ideally, city into states for example. Though this then becomes redundant. Or some sort of spatial relationship - especially if you're wanting to be able to interpret the results afterwards.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you're looking for some sort of rules to create these groups it sort of becomes a data mining problem in itself, using decision trees or clustering is one method.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 25 May 2017 14:18:26 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2017-05-25T14:18:26Z</dc:date>
    <item>
      <title>How to reduce number of levels in an input variable in SAS EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361602#M5363</link>
      <description>&lt;P&gt;Hey&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a dataset where some of the input variables has a lot of levels. E.g., School_city with 513 levels and School_state with 49 levels. How can I reduce the number of levels in an input variable, or in some way group levels together in SAS Enterprise Miner?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm kind of new to SAS EM, so I need some help figuring this out.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;/LB&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2017 13:56:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361602#M5363</guid>
      <dc:creator>Lbind91</dc:creator>
      <dc:date>2017-05-25T13:56:46Z</dc:date>
    </item>
    <item>
      <title>Re: How to reduce number of levels in an input variable in SAS EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361617#M5365</link>
      <description>&lt;P&gt;Usually I'd say use some sort of binning but you have ordinal variables.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You need to group them in some logical manner ideally, city into states for example. Though this then becomes redundant. Or some sort of spatial relationship - especially if you're wanting to be able to interpret the results afterwards.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you're looking for some sort of rules to create these groups it sort of becomes a data mining problem in itself, using decision trees or clustering is one method.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2017 14:18:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361617#M5365</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2017-05-25T14:18:26Z</dc:date>
    </item>
    <item>
      <title>Re: How to reduce number of levels in an input variable in SAS EM</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361623#M5366</link>
      <description>&lt;P&gt;Thanks for the input, Reeza&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried using a decision tree to consolidate the levels. It worked for the variables seperately and grouped the levels. But then I cannot figure out how to get the 5 different consolidated trees into the data again?&amp;nbsp;&lt;/P&gt;&lt;P&gt;The outputs&amp;nbsp;from the separate decision trees are just _NODE_ for the new variables derived. So can I change the name, so they wont have&amp;nbsp;the same name?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2017 14:28:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-reduce-number-of-levels-in-an-input-variable-in-SAS-EM/m-p/361623#M5366</guid>
      <dc:creator>Lbind91</dc:creator>
      <dc:date>2017-05-25T14:28:20Z</dc:date>
    </item>
  </channel>
</rss>

