BookmarkSubscribeRSS Feed
subham
Calcite | Level 5

Came to know from articles that e miner automatically selects the model that has the least average square error (or missclassification) on the validation data set. This is called stopped training method which definitely helps to ensure NN does not overfit. So it's mean that even if the model converges after say 56th iteration, it may select the final model with 39th iteration if validation dataset has minimum error (or missclassification) on this particular iteration. So my confusion is if I assign training and test data set (insetad of validation) then will the e miner select the last model (that means weight estimate after 56th iteration in my example) ? Since there is a direct involvement of the validation data on building the model, how correct the method is? I mean the model might be different if I select some other validation data set.

1 REPLY 1
WendyCzika
SAS Employee

If you don't use a validation partition for early stopping, then yes, as you say the neural network model will likely overfit and not generalize well to new data.  So you really want to use a validation partition when building your neural network.  

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1067 views
  • 0 likes
  • 2 in conversation