BookmarkSubscribeRSS Feed
deleted_user
Not applicable
Dear All,
in order to get started with decision trees I would like to ask my computer to produce a decision tree from a data set with one dependent variable (disease yes/no) and several independent variables (usually I would run logistic regression for these data). The tree should "decide" which variables are important risk factors for the disease. Any hints for a bloody beginner in decision trees but fairly experienced SAS user (no access to SAS enterprise miner)?
Thanks in advance
1 REPLY 1
deleted_user
Not applicable
You might want to run PROC ARBOR directly, which is the background procedure for the Decision Tree node in EM. Please refer to the documentation http://support.sas.com/documentation/onlinedoc/miner/em43/proc_arbor.pdf. This is an olde version, but is still mostly valid.

For your case, the following generic code should work.

proc arbor data=;
input ...;
target disease;
save importance=;
score data= out=;
quit;

The SAVE IMPORTANCE= dataset contains variable importance results based on splits. The SCORE OUT= dataset contains the output results such as predictions for the score dataset DATA=.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 2844 views
  • 0 likes
  • 1 in conversation