Hello everyone. It is my understanding that if one wants to build multiple time series models on a time series that went from 2000 to today (2015) monthly; and one wanted to use that information to forecast 3 months in the future, it is common approach to split your data into "Train" and "test" datasets.
Your test dataset would be the last 3 months of your time series (jan 2015, feb 2015, march 2015) (pretend we're already in april for simplicity sake). You would then 'define' your model on your Training dataset, and then compute it's errors against your test dataset.
This way you could try out many multiple models and pick the one with the lowest "Forecast prediction error".
However my question is: By ignorning those last 3 months of data, how do you then use that model to forecast values later in time? Example: say you wanted to forecast April-June. Is it standard procedure to apply the same model (that wasn't built on the last 3 months) to the April - June forecast period? Or is it standard procedure to 're-build' the model on the entire time series (2000 to 2015 March) and then use that model to forecast into April-June?
I am pretty confused by this and any clarification would be helpful.
Thanks all!
Yes: Standard procedure to re-fit the winning model on the entire time series and then use that model to forecast.
Yes: Standard procedure to re-fit the winning model on the entire time series and then use that model to forecast.
Hello -
Attached presentation might be of interest.
Thanks,
Udo
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn how to run multiple linear regression models with and without interactions, presented by SAS user Alex Chaplin.
Find more tutorials on the SAS Users YouTube channel.