BookmarkSubscribeRSS Feed
jwoods
Calcite | Level 5

Hi,

 

I have experience in SAS but I'm having trouble using SAS EM. I have a few questions:

 

1. How is purity measured/ranked?

2. Is the purity of a node based on the training data or the validation data? 

3. Why is the purity of a node indicated by training/validation (answer to number 2) data?

4. How do you know (by looking at a tree), which nodes are the purist?

 

Thanks

1 REPLY 1
WendyCzika
SAS Employee

You can use Gini impurity as the splitting criterion when growing the decision tree, which has formula:

 

 

 

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 16. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1062 views
  • 0 likes
  • 2 in conversation