BookmarkSubscribeRSS Feed
bph0103
Calcite | Level 5

I am creating and using OLS Regression models using historical data to forecast quarterly balances for banking products (loans, mortgages, deposits, etc) for the Dodd-Frank/CCAR exercises. One problem we have run into is that sometimes the last historical time period's value (jump-off point which is used to start forecasting from) can be unexpectedly high or low (possibly due to a business action like a temporary interest rate change for marketing purposes or maybe an unusual event in the marketplace). This creates a problem in that the forecast generated is unusually high or low due to the out of the ordinary jump-off point. Any ideas on how to adjust for this in the model? An initial thought is to forecast from a previous data point where the growth rate of the data point is within a certain acceptable range.  Any thoughts, ideas, or references to scholarly papers on this topic would be helpful. Thanks.

4 REPLIES 4
PGStats
Opal | Level 21

I don't have time to figure out what a "Dodd-Frank/CCAR exercise" is, but identifying outlier points in OLS regression is best done with proc robustreg. Check the OUTLIER= option in the OUTPUT statement.

PG
bph0103
Calcite | Level 5

Thanks for the reply PG. Actually Dodd-Frank/CCAR can be put aside. Basically I have a historical time series of quarterly data that I used to develop an OLS regression equation, and I am using the OLS regression equation to forecast future time points. The problem is that the last historical time point IS an outlier. So my question is if I still want to use that last (outlier) point as my jump off point to start forecasting, is there a way to adjust either that point or the future forecast to take that issue into account? Currently my forecast is unreasonable since the jump off (last historical) point was unreasonable to start with.

PGStats
Opal | Level 21

The idea would be to reduce the importance of that last point in your forecasting equation, either by removing it, downweighting it, or diluting it by using more terms in your model. 

PG
mkeintz
PROC Star

You could use, say, the 4-quarter trailing average for the jump-off date.

 

--------------------------
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set

Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets

--------------------------

sas-innovate-2024.png

 

Time is running out to save with the early bird rate. Register by Friday, March 1 for just $695 - $100 off the standard rate.

 

Check out the agenda and get ready for a jam-packed event featuring workshops, super demos, breakout sessions, roundtables, inspiring keynotes and incredible networking events. 

 

Register now!

Multiple Linear Regression in SAS

Learn how to run multiple linear regression models with and without interactions, presented by SAS user Alex Chaplin.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 877 views
  • 0 likes
  • 3 in conversation