BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
niam
Quartz | Level 8

I have come across this strange case in SAS Time Series Forecasting System and it seems that SAS is displaying wrong estimates.

Here is a simple example. 

When I fit a quadratic trend to my dataset using TSFS, I get the following estimates:

 

Model Parameter Estimate Std. Error T Prob>|T|
Intercept 3.85465 0.3471 11.1041 <.0001
Linear Trend -0.03373 0.0089 -3.7788 0.0003
Quadratic Trend -0.00117 0.000353 -3.3153 0.0013
Model Variance (sigma squared) 5.08917 . . .

Now, when I run a simple regression (proc reg) on the same dataset, using proc reg, I get totally different results

 

Parameter Estimates
Variable DF Parameter
Estimate
Standard
Error
t Value Pr> |t|
Intercept 1 2.52733 0.71316 3.54 0.0006
t 1 0.08578 0.03465 2.48 0.0152
t2 1 -0.00117 0.00035340 -3.32 0.0013

Interestingly, the oredicted values shown by TSFS system seem to be following the estimates shown by the PROC REG. 

Here is the predicted values of TSFS for the first three observations

 

Obs PERIOD ACTUAL Predicted value
for INCIDENTS
Upper 95% Confidence
Limit
Lower 95% Confidence
Limit
Prediction error
for INCIDENTS
Prediction standard
error
Normalized prediction
error for INCIDENTS
Linear Trend Quadratic Trend
1 NOV2009 1.0000 2.6119 7.0335 -1.8096 -1.6119 2.2559 -0.7145 -50 2500
2 DEC2009 2.0000 2.6942 7.1157 -1.7273 -0.6942 2.2559 -0.3077 -49 2401
3 JAN2010 4.0000 2.7741 7.1956 -1.6474 1.2259 2.2559 0.5434 -48 2304

 

The values in the fourth column (Predicted value for INCIDENTS) are based the parameter estimates form PROC REG and not the estimates from TSFS. for example, for first observations:

 

2.52733+ 0.08578*1 -0.00117*(1^2)=2.6119

 

I noticed that this discrepancy between the estimates of proc OLS and TSFS happens only when there are high order terms included in the model (such as quadratic or cubic). 

Why is this happening and how can I get correct estimates using TSFS. 

 

1 ACCEPTED SOLUTION

Accepted Solutions
dw_sas
SAS Employee

Hi,

 

The differences you observed in the parameter estimates between the TSFS results and PROC REG are due to a difference in the definition of the linear and quadratic time trend terms.  For the quadratic time trend curve, the TSFS defines the _LINEAR_ regressor as  t - c, and the _QUAD_ regressor as ( t - c ) **2, whereas your PROC REG step uses t and t**2, respectively.  Details on the definitions of the time trend terms used by the TSFS can be found in the following section of the documentation:

 

http://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_tffordet_sect0...

 

For your data (and most of the other series I tested), c=50.  You can determine the value of "c" used by the TSFS by looking at the Forecast Table in the Model Viewer.  The last two columns of this table contain the values of the _LINEAR_ and _QUAD_ variables.

 

To reproduce the parameter estimates observed in the TSFS by using PROC REG, you will need to define your time trend variables in the same manner as was used in the TSFS.  Please note that the predicted values will be the same between the TSFS and PROC REG, regardless of which definition of the time trend terms is used.

 

I hope this helps!

DW

View solution in original post

2 REPLIES 2
dw_sas
SAS Employee

Hi,

 

The differences you observed in the parameter estimates between the TSFS results and PROC REG are due to a difference in the definition of the linear and quadratic time trend terms.  For the quadratic time trend curve, the TSFS defines the _LINEAR_ regressor as  t - c, and the _QUAD_ regressor as ( t - c ) **2, whereas your PROC REG step uses t and t**2, respectively.  Details on the definitions of the time trend terms used by the TSFS can be found in the following section of the documentation:

 

http://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_tffordet_sect0...

 

For your data (and most of the other series I tested), c=50.  You can determine the value of "c" used by the TSFS by looking at the Forecast Table in the Model Viewer.  The last two columns of this table contain the values of the _LINEAR_ and _QUAD_ variables.

 

To reproduce the parameter estimates observed in the TSFS by using PROC REG, you will need to define your time trend variables in the same manner as was used in the TSFS.  Please note that the predicted values will be the same between the TSFS and PROC REG, regardless of which definition of the time trend terms is used.

 

I hope this helps!

DW

niam
Quartz | Level 8

Thank you very much for your response! It was really helpful.

 

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

Multiple Linear Regression in SAS

Learn how to run multiple linear regression models with and without interactions, presented by SAS user Alex Chaplin.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 962 views
  • 0 likes
  • 2 in conversation