Hello everyone. It is my understanding that if one wants to build multiple time series models on a time series that went from 2000 to today (2015) monthly; and one wanted to use that information to forecast 3 months in the future, it is common approach to split your data into "Train" and "test" datasets.
Your test dataset would be the last 3 months of your time series (jan 2015, feb 2015, march 2015) (pretend we're already in april for simplicity sake). You would then 'define' your model on your Training dataset, and then compute it's errors against your test dataset.
This way you could try out many multiple models and pick the one with the lowest "Forecast prediction error".
However my question is: By ignorning those last 3 months of data, how do you then use that model to forecast values later in time? Example: say you wanted to forecast April-June. Is it standard procedure to apply the same model (that wasn't built on the last 3 months) to the April - June forecast period? Or is it standard procedure to 're-build' the model on the entire time series (2000 to 2015 March) and then use that model to forecast into April-June?
I am pretty confused by this and any clarification would be helpful.
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.