BookmarkSubscribeRSS Feed
subham
Calcite | Level 5

Came to know from articles that e miner automatically selects the model that has the least average square error (or missclassification) on the validation data set. This is called stopped training method which definitely helps to ensure NN does not overfit. So it's mean that even if the model converges after say 56th iteration, it may select the final model with 39th iteration if validation dataset has minimum error (or missclassification) on this particular iteration. So my confusion is if I assign training and test data set (insetad of validation) then will the e miner select the last model (that means weight estimate after 56th iteration in my example) ? Since there is a direct involvement of the validation data on building the model, how correct the method is? I mean the model might be different if I select some other validation data set.

1 REPLY 1
WendyCzika
SAS Employee

If you don't use a validation partition for early stopping, then yes, as you say the neural network model will likely overfit and not generalize well to new data.  So you really want to use a validation partition when building your neural network.  

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1476 views
  • 0 likes
  • 2 in conversation