<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic PROC HPSPLIT with a unbalanced outcome and 3 categories in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-HPSPLIT-with-a-unbalanced-outcome-and-3-categories/m-p/479728#M24953</link>
    <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With PROC HPSPLIT there are some options for dealing with dichotomous outcome variables that are very unbalanced.&amp;nbsp; But what if the outcome has 3 (or more) levels and they are unbalanced?&amp;nbsp; I could not find any options to deal with this. For instance, using SAS 9.4 on Windows I did this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data new;
        set sashelp.bweight;
        count + 1;

		if weight &amp;lt; 1500 then bwcat = "1: Very low";
		else if weight &amp;lt; 2500 then bwcat = "2: low";
		else bwcat = "3: Normal";
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;and then&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc hpsplit data = new seed = 123;
   class black boy married momedlevel momsmoke bwcat;
   model bwcat = black boy married momedlevel momsmoke momage momwtgain visit cigsperday;
   output out=hpsplout;
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;the result is not good.&amp;nbsp; None of the very low BW babies are correctly classified, and less than 2% of the low BW babies are correctly classified. For a dichotomous outcome, we can play with the sensitivity level in scoring, but that has no real analogue here.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any thoughts or suggestions are welcome.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Peter&lt;/P&gt;</description>
    <pubDate>Thu, 19 Jul 2018 21:41:35 GMT</pubDate>
    <dc:creator>plf515</dc:creator>
    <dc:date>2018-07-19T21:41:35Z</dc:date>
    <item>
      <title>PROC HPSPLIT with a unbalanced outcome and 3 categories</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-HPSPLIT-with-a-unbalanced-outcome-and-3-categories/m-p/479728#M24953</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With PROC HPSPLIT there are some options for dealing with dichotomous outcome variables that are very unbalanced.&amp;nbsp; But what if the outcome has 3 (or more) levels and they are unbalanced?&amp;nbsp; I could not find any options to deal with this. For instance, using SAS 9.4 on Windows I did this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data new;
        set sashelp.bweight;
        count + 1;

		if weight &amp;lt; 1500 then bwcat = "1: Very low";
		else if weight &amp;lt; 2500 then bwcat = "2: low";
		else bwcat = "3: Normal";
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;and then&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc hpsplit data = new seed = 123;
   class black boy married momedlevel momsmoke bwcat;
   model bwcat = black boy married momedlevel momsmoke momage momwtgain visit cigsperday;
   output out=hpsplout;
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;the result is not good.&amp;nbsp; None of the very low BW babies are correctly classified, and less than 2% of the low BW babies are correctly classified. For a dichotomous outcome, we can play with the sensitivity level in scoring, but that has no real analogue here.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any thoughts or suggestions are welcome.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Peter&lt;/P&gt;</description>
      <pubDate>Thu, 19 Jul 2018 21:41:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-HPSPLIT-with-a-unbalanced-outcome-and-3-categories/m-p/479728#M24953</guid>
      <dc:creator>plf515</dc:creator>
      <dc:date>2018-07-19T21:41:35Z</dc:date>
    </item>
  </channel>
</rss>

