After run 'rule induction' in EM12.1, from the results I got the first rip with a 100 pure node(say node 7) , but the next rips didn't take the 7 node rows away. Anyone knows why? Thanks.
Can you explain more what you mean by the 7 node rows were not taken away?
Thanks,
Wendy
Actual situation was that node 4, in rip4, had 100% pure (see below a tested sample), but was not removed (it said: no leaf was rippd from the model), I expected node 4's rows/observations should be removed for next rips, or further modeling process.
RIP4 Leaf Table: Threshold= 100
No leaf was ripped from the model.
Predicted: Predicted:
Node N target=0 target=1
4 484942 1.0000 0.0000
11 50101 0.9993 0.0007
...
...
....
I see, thank you for clarifying.
My only thought is that with the large sample size for node 4, maybe the purity isn't exactly 100% as shown, but actually 99.999 or something and that precision is just not displayed here. You could try changing the Purity Threshold property to 99 and see if it rips node 4 as expected in that case.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and save with the early bird rate—just $795!
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.