BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
pvareschi
Quartz | Level 8

Re: Predicting Analytics on Big Data

How is the Out-Of-Bag Error calculated for a random forest fitted through RANDOMWOODS in Proc IMSTAT (see page 3-123 of the course text)?

Is it an average of the errors on each out-of-bag sample, calculated as Average Square Error?

1 ACCEPTED SOLUTION

Accepted Solutions
gcjfernandez
SAS Employee
Yes you are correct. It is the mean of ASE of all the out-of-bag samples.

View solution in original post

2 REPLIES 2
gcjfernandez
SAS Employee
Yes you are correct. It is the mean of ASE of all the out-of-bag samples.
lovelmark
Calcite | Level 5

Out of bag error is simply error computed on samples not seen during training. Out-of-bag estimate for the generalization error is the error rate of the out-of-bag classifier on the training set (compare it with known yi's). In Breiman's original implementation of the random forest algorithm, each tree is trained on about 2/3 of the total training data. As the forest is built, each tree can thus be tested (similar to leave one out cross validation) on the samples not used in building that tree. This is the out of bag error estimate - an internal error estimate of a random forest as it is being constructed.

 

 

This is a knowledge-sharing community for learners in the Academy. Find answers to your questions or post here for a reply.
To ensure your success, use these getting-started resources:

Estimating Your Study Time
Reserving Software Lab Time
Most Commonly Asked Questions
Troubleshooting Your SAS-Hadoop Training Environment

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1104 views
  • 0 likes
  • 3 in conversation