BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pvareschi
Quartz | Level 8

Re: Predicting Analytics on Big Data

What is the purpose/effect of option NBINS when used with statement DECISIONTREE (page 3-89 of course text)?

The reference reported on page 3.70 states: "specifies the number of bins to use in the calculation of the decision tree. The number of bins affects the accuracy of the tree"

What does "use in the calculation of the decision tree" mean? Does it affect how the splitting algorithm works?

1 ACCEPTED SOLUTION

Accepted Solutions
gcjfernandez
SAS Employee

Re: Predicting Analytics on Big Data

What is the purpose/effect of option NBINS when used with statement DECISIONTREE (page 3-89 of course text)?

The reference reported on page 3.70 states: "specifies the number of bins to use in the calculation of the decision tree. The number of bins affects the accuracy of the tree"

What does "use in the calculation of the decision tree" mean? Does it affect how the splitting algorithm works?

My Answer:

In IMSTAT Decision Tree modeling during the split search, the interval input variables are binned and the optimal split point are determined using information gain. Users can choose the number of bins (2 to 10) to transform the interval input variables to binned variable.

 

View solution in original post

1 REPLY 1
gcjfernandez
SAS Employee

Re: Predicting Analytics on Big Data

What is the purpose/effect of option NBINS when used with statement DECISIONTREE (page 3-89 of course text)?

The reference reported on page 3.70 states: "specifies the number of bins to use in the calculation of the decision tree. The number of bins affects the accuracy of the tree"

What does "use in the calculation of the decision tree" mean? Does it affect how the splitting algorithm works?

My Answer:

In IMSTAT Decision Tree modeling during the split search, the interval input variables are binned and the optimal split point are determined using information gain. Users can choose the number of bins (2 to 10) to transform the interval input variables to binned variable.

 

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 1 reply
  • 341 views
  • 0 likes
  • 2 in conversation