BookmarkSubscribeRSS Feed
subham
Calcite | Level 5

Came to know from articles that e miner automatically selects the model that has the least average square error (or missclassification) on the validation data set. This is called stopped training method which definitely helps to ensure NN does not overfit. So it's mean that even if the model converges after say 56th iteration, it may select the final model with 39th iteration if validation dataset has minimum error (or missclassification) on this particular iteration. So my confusion is if I assign training and test data set (insetad of validation) then will the e miner select the last model (that means weight estimate after 56th iteration in my example) ? Since there is a direct involvement of the validation data on building the model, how correct the method is? I mean the model might be different if I select some other validation data set.

1 REPLY 1
WendyCzika
SAS Employee

If you don't use a validation partition for early stopping, then yes, as you say the neural network model will likely overfit and not generalize well to new data.  So you really want to use a validation partition when building your neural network.  

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1222 views
  • 0 likes
  • 2 in conversation