BookmarkSubscribeRSS Feed
Zachary
Obsidian | Level 7

I think I need the Code Node to accomplish this, but I am not sure.

I am running a Decision Tree on a set of data composed of about 10,000 records. The dependent variable is nominal/dichotomous, and I expect about 90% of the data will be predicted to be a 0 while the other 10% will be predicted to be a 1.

I then want to put another Decision Tree into the project. That one will use a different continuous dependent variable and it will be only for cases that are predicted to be 0 for the first Decision Tree- again, around 9,000 cases.

It sounds kind of basic, but kind of not. Either way, can anyone suggest the best way to do things in Enterprise Miner for this? Thank you. Much appreciated.

FOLLOW-UP:

Actually the TwoStage Node seems like it will suit my needs well. However, I am modeling two differeent dependent/response variables. One for categorical, and one continuous. So perhaps that is not usable.

1 REPLY 1
WendyCzika
SAS Employee

I think TwoStage will do exactly what you want.  Just set both dependent variables as Targets in your Input Data node, and in the TwoStage node, change the Filter property to Non-Events, and Value Model to Tree.  You just need to be sure you are using the correct Order (Ascending) for the nominal target so that 0's are treated as the event, then the predicted non-0s will be excluded from the second tree where you are modeling the continuous target. 

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 879 views
  • 0 likes
  • 2 in conversation