BookmarkSubscribeRSS Feed
plf515
Lapis Lazuli | Level 10

Hello

 

With PROC HPSPLIT there are some options for dealing with dichotomous outcome variables that are very unbalanced.  But what if the outcome has 3 (or more) levels and they are unbalanced?  I could not find any options to deal with this. For instance, using SAS 9.4 on Windows I did this:

 

data new;
        set sashelp.bweight;
        count + 1;

		if weight < 1500 then bwcat = "1: Very low";
		else if weight < 2500 then bwcat = "2: low";
		else bwcat = "3: Normal";
run;

and then

proc hpsplit data = new seed = 123;
   class black boy married momedlevel momsmoke bwcat;
   model bwcat = black boy married momedlevel momsmoke momage momwtgain visit cigsperday;
   output out=hpsplout;
run;

the result is not good.  None of the very low BW babies are correctly classified, and less than 2% of the low BW babies are correctly classified. For a dichotomous outcome, we can play with the sensitivity level in scoring, but that has no real analogue here.

 

Any thoughts or suggestions are welcome. 

 

Peter

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 0 replies
  • 1073 views
  • 0 likes
  • 1 in conversation