Dear Group,
i am attempting a classification tree for a medical outcome. both the outcome variable and independent variables are binary.
my tree looks like it is reading the program as if it is a continuous independent variable (but it's actually 0|1). For example, there are input values of 0.10 and 0.01. i do not understand that, and the SAS example i read on line for classification trees show different outputs.
Does any one have some idea of what could be going on here? Also, was expecting a chi sq or p value. (?)
MY code:
data pam.inhib;
proc hpsplit data=pam.copyinhib;
class inhibition;
model inhibition =
acephate chlorpyrifos ddvp diazinon dimethoate methomyl mevinphos carbaryl carbofuran oxydemeton_methyl handler;
grow chaid;
run;
i've attached the tree
THANK YOU!
best, pam
There's a lot of output there, can you specify the output that shows you have an issue, page and table/graph perhaps?
Specifically related to:
my tree looks like it is reading the program as if it is a continuous independent variable (but it's actually 0|1). For example, there are input values of 0.10 and 0.01. i do not understand that, and the SAS example i read on line for classification trees show different outputs.
All of the predictor variables are considered as continuous unless you also specify them in the CLASS statement. See the descriptions of the CLASS and MODEL statements in the PROC HPSPLIT documentation.
got it, great!!
thank you very much.
pam
Hi Reeza,
thanks for reaching back. The problem is fixed (it was the class statement).
i am new to this. i see that SAS is splitting the tree only once now that i've fixed that statement. my new question: is there a way to ask SAS to make more than one split?
best, pam
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.