After run 'rule induction' in EM12.1, from the results I got the first rip with a 100 pure node(say node 7) , but the next rips didn't take the 7 node rows away. Anyone knows why? Thanks.
Can you explain more what you mean by the 7 node rows were not taken away?
Thanks,
Wendy
Actual situation was that node 4, in rip4, had 100% pure (see below a tested sample), but was not removed (it said: no leaf was rippd from the model), I expected node 4's rows/observations should be removed for next rips, or further modeling process.
RIP4 Leaf Table: Threshold= 100
No leaf was ripped from the model.
Predicted: Predicted:
Node N target=0 target=1
4 484942 1.0000 0.0000
11 50101 0.9993 0.0007
...
...
....
I see, thank you for clarifying.
My only thought is that with the large sample size for node 4, maybe the purity isn't exactly 100% as shown, but actually 99.999 or something and that precision is just not displayed here. You could try changing the Purity Threshold property to 99 and see if it rips node 4 as expected in that case.
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.