BookmarkSubscribeRSS Feed
aha123
Obsidian | Level 7

For my dataset, I run Regression, Neural Network,  Decision Trees & Gradient Boosting Nodes. GB node always has the highest MSE, even higher than Regression. Even after I changed N interations to 200, it is still the worst one. Could you suggest what other parameters I can try out to improve the outcome? I run my dataset on R & Python, GB from both give very good results. Don't know what is wrong with SAS's GB node. My problem is of regression, not of classification.

4 REPLIES 4
WendyCzika
SAS Employee

One thing to try would be using a smaller value for Leaf Fraction - could be too high to find splits.

Babloo
Rhodochrosite | Level 12

May I know what is leaf fraction and why we need to use a smaller for it?

Babloo
Rhodochrosite | Level 12

May I know what is leaf fraction in Gradient Boosting?

WendyCzika
SAS Employee

Use the Leaf Fraction property to specify the smallest number of training observations a new branch may have, expressed as the proportion of the number N of available training observations in the data. N may be less than the total number of observations in the data set because observations with a missing target value are excluded.

 

So setting this to a smaller value grows larger trees.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 1835 views
  • 0 likes
  • 3 in conversation