BookmarkSubscribeRSS Feed
plf515
Lapis Lazuli | Level 10

Hello

 

With PROC HPSPLIT there are some options for dealing with dichotomous outcome variables that are very unbalanced.  But what if the outcome has 3 (or more) levels and they are unbalanced?  I could not find any options to deal with this. For instance, using SAS 9.4 on Windows I did this:

 

data new;
        set sashelp.bweight;
        count + 1;

		if weight < 1500 then bwcat = "1: Very low";
		else if weight < 2500 then bwcat = "2: low";
		else bwcat = "3: Normal";
run;

and then

proc hpsplit data = new seed = 123;
   class black boy married momedlevel momsmoke bwcat;
   model bwcat = black boy married momedlevel momsmoke momage momwtgain visit cigsperday;
   output out=hpsplout;
run;

the result is not good.  None of the very low BW babies are correctly classified, and less than 2% of the low BW babies are correctly classified. For a dichotomous outcome, we can play with the sensitivity level in scoring, but that has no real analogue here.

 

Any thoughts or suggestions are welcome. 

 

Peter

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 0 replies
  • 1181 views
  • 0 likes
  • 1 in conversation