Wondering if pruning gives up too much valuable information.
Yes, the model becomes smaller, and supposedly more understandable.
But all those tips being whacked contain at least pairs of variables that ought to be acknowledged.
What are your thoughts on this conundrum?
Thanks!
Nicholas Kormanik
This is true in any modeling, the larger the model (in this case the more branches), the better it will fit (apparently).
But there is also a concept called overfitting, which would usually lead to smaller models, as overfitting is not good and essentially is fitting noise. Those extra branches that should get pruned may in fact be overfitting. How can you tell if something is overfit? Usually, by either crossvalidation, or by fitting the model to a training data set, and then evaluating its performance on validation (and test) data sets
Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.
Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.