BookmarkSubscribeRSS Feed
deleted_user
Not applicable
Dear All,
in order to get started with decision trees I would like to ask my computer to produce a decision tree from a data set with one dependent variable (disease yes/no) and several independent variables (usually I would run logistic regression for these data). The tree should "decide" which variables are important risk factors for the disease. Any hints for a bloody beginner in decision trees but fairly experienced SAS user (no access to SAS enterprise miner)?
Thanks in advance
1 REPLY 1
deleted_user
Not applicable
You might want to run PROC ARBOR directly, which is the background procedure for the Decision Tree node in EM. Please refer to the documentation http://support.sas.com/documentation/onlinedoc/miner/em43/proc_arbor.pdf. This is an olde version, but is still mostly valid.

For your case, the following generic code should work.

proc arbor data=;
input ...;
target disease;
save importance=;
score data= out=;
quit;

The SAVE IMPORTANCE= dataset contains variable importance results based on splits. The SCORE OUT= dataset contains the output results such as predictions for the score dataset DATA=.

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 3594 views
  • 0 likes
  • 1 in conversation