BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
ajkalale
Obsidian | Level 7

Is the test data and the validation dataset the same?

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

The terms can vary a bit. But the idea is three datasets:

 

  1. training - used to develop a model
  2. validation - used to calibrate model. If model doesn’t perform well, revisit model and use this data to validate
  3. test - used in final stages, to get final model accuracy  numbers. Used to prevent over fitting and provide an 'honest assessment’. You would not retrain a model if this didn’t score well 

View solution in original post

1 REPLY 1
Reeza
Super User

The terms can vary a bit. But the idea is three datasets:

 

  1. training - used to develop a model
  2. validation - used to calibrate model. If model doesn’t perform well, revisit model and use this data to validate
  3. test - used in final stages, to get final model accuracy  numbers. Used to prevent over fitting and provide an 'honest assessment’. You would not retrain a model if this didn’t score well 

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 1858 views
  • 1 like
  • 2 in conversation