About SASCom1

SASCom1 · ‎07-22-2025

You can specify any estimation method when fitting the ARIMA model to the input to determine the ARMA parameters used in the prewhitening filter, but when you compute the prewhitened series manually, residuals computed using METHOD = CLS match those obtained from applying the prewhitening filter, hence the final cross correlations will match. Different estimation methods would result in different forecasts hence different residuals.

SASCom1 · ‎07-21-2025

In your manual computation, the ARMA parameters used is from method = ML , while in the PROC ARIMA step directly, the method used is default method = CLS. To avoid the problem of using different parameters in manual computation, you may save the parameters from the direct PROC ARIMA step without estimating the model in another step. Also, regardless of which estimation method is used to obtain the ARMA prewhitening filter, you may want to use default method = CLS when applying the filter to the response and input series in the manual computation. Here is modified program that recomputes the cross correlations with prewhitening in PROC ARIMA following the steps outlined earlier. The recomputed cross correlations match those from PROC ARIMA directly. proc iml; phi = {1 -0.5}; theta = {1 0.8}; Ytseries = armasim(phi, theta, 125, 1, 100, -1234321); create Yt from Ytseries[colname={'Yt'}]; append from Ytseries; quit; data test; Xt = 100; nl = 0; al = 0; do i = 1 to 100; a = rannor(12345); n = 0.75 * nl + 0.5 * al + a; al = a; nl = n; z = n + 1; Xt = Xt + z; date = intnx('month', '1jan1988'd, i-1); format date monyy.; output; end; drop nl al i a n z; run; data test_new; retain date Xt; set test; run; data final; merge test_new yt; dXt=dif(Xt); run; proc print data = final ; run; /* cross correlations with prewhitening in PROC ARIMA directly */ proc arima data= final; identify var=Xt(1); estimate p = 1 q=1 /*method = ml */ outest = est(where=(_type_='EST')); identify var=Yt crosscorr=(Xt(1)) outcov = cov1(where = (crossvar ne ' ') keep = lag corr crossvar rename =(corr = corr1)); run;quit; /* recompute cross correlations with prewhitening in steps */ /* step 1. save ARMA parameters from the above model estimates */ data null; set est; call symput('ma',MA1_1); call symput('ar',AR1_1); run; /* step 2. prewhitening */ /* since first differencing is specified for input, create differenced x */ data dif ; set final ; dif1x = dif(Xt); run; /* centering y and centering differenced x series */ proc means data = dif; var dif1x Yt; output out = outm mean = mdif1x my; run; proc print data = outm; run; data centerdif ; set dif ; if _n_ = 1 then set outm ; cy = Yt - my ; cdif1x = dif1x - mdif1x; run; proc print data = centerdif; run; /* prewhiten x series using previously fitted model, and save residuals, x used is first differenced, and then centered */ proc arima data = centerdif ; identify var = cdif1x ; estimate p = 1 q = 1 noconstant ar = &ar ma = &ma noest ; forecast out = outx(rename =(residual = residx) keep =x residual) lead = 0 ; run; /* prewhiten y series using the same previously fitted model, and save residuals. y used is centered */ proc arima data = centerdif ; identify var = cy ; estimate p = 1 q = 1 noconstant ar = &ar ma = &ma noest ; forecast out = outy(rename =(residual = residy) keep = y residual) lead = 0 ; run; proc print data = outx; run; proc print data = outy; run; /*combine the residuals for the two prewhitened x and y */ data combine ; set outx ; set outy; run; proc print data = combine; run; /* step 3. using PROC ARIMA to compute crosscorrelations on the two prewhitened series */ proc arima data=combine; identify var=residy crosscorr=residx outcov=cov2(where=(crossvar ne ' ') keep=crossvar lag corr rename=(corr=corr2));; run; quit; /* compare the crosscorrelations from the original ARIMA */ /* step with the crosscorrelations from the ARIMA step */ /* run on the prewhitened series. The crosscorrelations */ /* are the same. */ data compare; merge cov1 cov2; by lag; drop crossvar; run; proc print data = compare; run;

SASCom1 · ‎07-18-2025

1. If you use PROC ARIMA to compute cross correlations between the prewhitened series, the procedure should automatically exclude those with missing observations due to different differencing orders, so you do not need to manually delete them. But you can if you want to. 2. Whether or not you specify NOCONSTANT option in the ARMA model for the input, the 'NOCONSTANT' option is specified in the filtering stage because the series(or differenced series) have already been centered. However, if you choose to leave the constant to be estimated, the estimate of the constant is likely to be very close to zero since the series have been centered, so the impact may not be significant. I hope this helps.

SASCom1 · ‎07-17-2025

@sasalex2024 , yes your understanding of the steps looks correct to me. If you follow the steps but the resulting cross correlations computed do not match those from PROC ARIMA directly, please let me know.

SASCom1 · ‎07-16-2025

When you estimate the model for the input series and save the parameters, you do not take extra step to center the input series. The model is estimated exactly as what is specified ; The estimated ARMA parameters are those used in the prewhitening filter later. You save the ARMA parameter estimates from this step, not the residuals. Centering is done only when the prewhitening filter is applied to the response and to the input, i.e., the prewhitening filter is applied to the centered response series and centered input series(or centered differenced series if differencing order is specified). You obtain residuals after applying the prewhiteneing filter to both the centered response and centered input. To avoid confusions in the text, I will use this following example in the documentation to illustrate the steps to recompute cross correlations with prewhitening in PROC ARIMA. Note that the example specifies no differencing for either y or x in any of the statements, but if differencing orders are specified, the prewhitening steps are the same except the series need to be differenced before applying the prewhitening filter(you can create the differenced series and then treat the differenced series the same way as in the no differencing case). SAS Help Center: Model for Series J Data from Box and Jenkins The relevant code used in PROC ARIMA documentation example are the following: proc arima data=seriesj; /*--- Look at the input process ----------------------------*/ identify var=x; /*--- Fit a model for the input ----------------------------*/ estimate p=3 plot; /*--- Cross-correlation of prewhitened series ---------------*/ identify var=y crosscorr=(x) nlag=12; run; (1). When PROC ARIMA process the first IDENTIFY statement and ESTIMATE statement for the input x, the AR(3) model is fit to the x series directly. This is a standard IDENTIFY-ESTIMATE statement processing. The estimated parameters from this model will be used in the prewhitening filter later, the prewhitening filter used is shown in output 8.3.5: Output 8.3.5: Prewhitening Filter Autoregressive Factors Factor 1: 1 - 1.97607 B**(1) + 1.37499 B**(2) - 0.34336 B**(3) So to reproduce the above step, you just specify the same IDENTIFY and ESTIMATE statement as above, and use OUTEST = option in the ESTIMATE statement to save the estimated AR parameters. (2). Then in the next IDENTIFY statement, to compute the cross correlations, the series will be prewhitened using the prewhitening filter above. But before applying the filter, the series y and x will be centered, i.e., centered y = y - mean(y), and centered x = x - mean(x). Then the above prewhitening AR filter is applied to centered y and centered x, and resulting prewhitened series-- residuals are obtained. To reproduce the above steps results yourself, you can fit AR(3) model with NOCONSTANT option to the centered y and centered x, but fix the AR parameters at the estimated AR parameters above using AR = option and NOEST option in the ESTIMATE statement, i.e., say you have created the centered x as cx, and centered y as cy in the data set, proc arima data = ; identify var = cx ; estimate p=3 noconstant ar = 1.97607 -1.37499 0.34336 noest ; then use FORECAST statement to save the residuals(prewhitened x): forecast out = outx(keep =x residual) lead = 0 ; Do the same to the centered y, and save residuals(prewhitened y) into data set. Now you have prewhitened y and prewhitened x, i.e., the two residual series saved above. Then finally the cross correlations are computed using the resulting two prewhitened series. You can use the following step with IDENTIFY statement: proc arima ; identify var=residualy crosscorr=residualx ; The cross correlations from the above step would be the same as those obtained in the PROC ARIMA steps in the documentation example. *Note: In the case you specify differencing for x and y, for example, if you specify the following instead: proc arima data=; identify var=x(1); estimate p = 3; identify var=y(12) crosscorr=(x(1)) ; then the only change you need to make in the above reproducing steps is, instead of creating the centered x and centered y shown above, you first create the difx = x(1), and dif12y =y(12), then center the difx and dif12y to obtain the two centered differenced series. All other steps follow the same logic as in the above example. I hope this helps. Please let me know if you have further questions.

SASCom1 · ‎07-15-2025

Hi @sasalex2024 Your steps look fine, however there are some details in step 2 and step 3 that are not mentioned and I will add here, for example, after you have estimated the ARMA model for the input, you save the ARMA model parameter estimates to be used when applying the prewhitening filter; when applying the prewhitening filter to the response and input, you center the series(or center the differenced series if differencing order is specified) before applying the filter. I hope this helps. Please let me know if you have further questions.

SASCom1 · ‎07-11-2025

@sasalex2024 , If you want to specify different differencing orders for response and input series, you can specify your desired differencing orders directly in the VAR = option and CROSSCORR = option in the IDENTIFY statement directly. If you specify an ARIMA model for the input prior to the IDENTIFY statement for the response variable with CROSSCORR option, then if you specify the same differencing order for the input as in the CROSSCORR = option, there should be no ambiguity in the differencing order applied to the input prior to prewhitening--the orders specified in the VAR = and CROSSCORR = option(also the same order as in the prior VAR = option in IDENTIFY statement for the input) in the IDENTIFY statement for the response will be the differencing orders used prior to prewhitening. The section in the documentation you referenced is only meant to explain what differencing order will be used prior to applying prewhitening filter in the event that the IDENTIFY statement for the input variable specifies different differencing order than the CROSSCORR = option in the IDENTFIY statement for the response. If the IDENTIFY statement for the response variable with CROSSCORR = option is followed by an ESTIMATE statement with P = and/or Q = option, then the differencing orders used on the response and input variable during estimation stage are exactly those specified in the VAR = option and CROSSCORR = option in the IDENTIFY statement directly. I am not aware of discussions of context where you want to specify different differencing orders in estimation, you may research in the literature to see if you can find more detailed discussions; however, transfer function identification is a complicated process, and it may take some trials to decide on the appropriate model to fit to certain data, if after the identifying stage you decide from the results that you should specify different differencing orders to estimate, then you can always specify your final desired differencing orders in another IDENTIFY statement followed by ESTIMATE statement to get your final estimation results. I hope this helps.

SASCom1 · ‎05-27-2025

In PROC AUTOREG, in addition to ARCH/GARCH model specification, you can also use HETERO statement to specify error variance function: https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/etsug/etsug_autoreg_syntax11.htm And the HETERO statement can also be used with GARCH model together to specify additional variables in the variance function to the standard GARCH equation as discussed here: https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/etsug/etsug_autoreg_details12.htm#etsug.autoreg.heterogarch PROC MODEL allows more flexible specification of error variance structures as discussed here: https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/etsug/etsug_model_sect133.htm If you have specific form of heteroscedasticity but not sure how to specify your model, please provide more details.

SASCom1 · ‎04-04-2025

Hi @SAS_Illyrian We did some further investigation, and our test confirmed that: (1). if you only specify P = option, and no Q = option is specified, then PROC VARMAX automatically forecasts future independent variables if they are not provided in the data set, using the same VAR model (the same order of P = option) for the dependent variables, then obtain forecast on the dependent variables using the thus forecasted independent variables for future periods ; (2). if you specify Q = option(with or without P = option), then unfortunately the procedure does not automatically obtain forecasts if future independent variables are not provided in the data set, and issue the warning you observed in the log. This is a bug in the procedure and we are working on fix of this issue, so that in the future it will behave the same as when only p = is specified discussed in (1) above. In the mean time, if you want future forecasts in case (2) then you may need to first forecast future independent variables yourself and save the future forecasts on the independent variables, then append the future forecasts to the end of the data. Then you can run PROC VARMAX on the appended data set to obtain future forecasts on the dependent variables. I hope this helps.

SASCom1 · ‎03-28-2025

Hi @Taliah I think this is probably what you are confused about: If you look at the regression model with AR error process discussed in PROC AUTOREG documentation: SAS Help Center: Autoregressive Error Model you can see that there is a negative sign in front of the AR parameter φ1 in PROC AUTOREG specification, if you write the complete model with AR(1) error: yt=xt′β+νt νt=ϵt−φ1*νt−1 since vt-1 = yt-1 - xt-1*β, this implies that yt = xt*β - φ1*(y_t-1 - x_t-1*β) + ϵt (1) Note the negative sign in front of φ1 in the above equation. This has opposite sign for the AR parameter than that specified in the usual ARIMA model expression, as in PROC ARIMA: yt = xt′β + ϵt/ϕ(B) , where ϕ(B)=1−ϕ1B for AR(1) case. If you multiply both sides of the above equation by (1−ϕ1B), then you get the following: yt - ϕ1*y_t-1 = xt′β - ϕ1*x_t-1*β + ϵt this implies yt = ϕ1*y_t-1 + xt′β - ϕ1*x_t-1*β + ϵt (2) If you compare (1) and (2), you can see that the two specifications have opposite signs on the AR parameter ϕ1. I hope this helps.

SASCom1 · ‎03-26-2025

Hi @SAS_Illyrian When future values of exogenous variables are provided in the data set, PROC VARMAX computes future forecasts using those supplied values for the exogenous variables; when future values of the exogenous variables are not provided in the input data set, PROC VARMAX is expected to compute future forecasts for the exogenous variables using the same VAR/VMA/VARMA model specification(i.e., with the same p = and q = specification) for the dependent variables, then forecasts for the dependent variables are obtained using those forecasted exogenous variables for the future periods. This should be done automatically and you do not need to manually do anything additional. So the warning message you observed and that you do not get forecasts when future exogenous variables are not provided in PROC VARMAX may indicate some issues that need further investigation. Can you contact Technical Support at: https://support.sas.com/en/technical-support.html#contact and we will further look into this issue. Thanks.

SASCom1 · ‎03-12-2025

To determine the correct code to specify, you need to know which series you want to test for heteroskedasticity. As mentioned earlier, the heteroskedasticity tests are all designed to test for heteroskedasticity in the residuals from the regression model specified in the procedures. If you specify, y = const + beta*x ; fit y /white breusch = (1 z1 z2); You are testing heteroskedasticity in the residuals from the regression of y on const and x, i.e., residual = y - const^ - beta^*x ; If you have already obtained residual series from the regression, or some regression somewhere, and you now specify residual = const + beta*time ; fit residual /white breusch=(1 time) ; You are then testing the heteroskedasticity of the residual, call it residual2, from the regression of residual on const and time, i.e., residual2 = residual - const^ - beta^*time rather than testing the heteroskedasticity in the original residual series itself. I hope this helps.

SASCom1 · ‎03-10-2025

I would like to clarify on the following: 1. All heteroskedasticity tests are valid tests for heterskedasticity, they just test for different forms of heteroskedasticity. Engle's ARCH test is only one test specifically to test for ARCH type of heteroskedasticity. The ARCHTEST = (ALL) option in PROC AUTOREG will produce a variety of ARCH type tests listed here, including Engle's ARCH test: https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/etsug/etsug_autoreg_details27.htm#etsug.autoreg.hetnnortsts The BREUSCH = ( ) option in PROC MODEL does Breusch-Pagan heteroskedasticity test. In order to do this test, you need to name the variable that is assumed to be in the conditional variance function. 2. When you do the ARCH tests in PROC AUTOREG, the procedure provides output for the tests for ARCH orders up to 12. The test results at all orders are helpful diagnostics that you can look at. The purpose of the test is to determine if there is ARCH type heteroskedasticity, and if so, you may need to account for it in your model. 3. If the ARCH tests find significant ARCH effect, then you may want to account for the ARCH effect in your model. In practice it may not be an easy task to determine the ARCH/GARCH order. Information criteria is one commonly used method to help you decide on the ARCH/GARCH order. You may do some research in the literature to see if there are other selection methods proposed. I hope this helps. Good luck!

SASCom1 · ‎03-07-2025

I am afraid I do not have answer to that question. When you perform a Breusch-Pagan test, you are making an assumption and test that assumption, that the error variance is not constant, but is changing with the variables z1, z2, etc. which you want to test, according to h_t = sigma^2*(alpha0 + alpha1*z1 + alpha2*z2 + ... + ). Different data may have different variables, or may not have any variables impacting the error variance. Similarly, the ARCH test is also making an assumption and testing that assumption, that the conditional error variance depends on the past errors, according to h_t = alpha0 + alpha1*(epsilon_t-1)^2 + alpha2*(epsilon_t-2)^2 + ..... + alpha_p*(epsilon_t-p)^2. You might examine the behavior and pattern of squared residuals and see if that gives you some indication. If you have reasons to suspect and make an assumption, you can test that assumption. Good luck!

SASCom1 · ‎03-06-2025

You are welcome, @sasalex2024 . z1 and z2 in the breusch=(1 z1 z2) option are the variables that you want to test in the Breusch-Pagan test, whether the error variance depend on these variables. You will need to have the variable(s) available in the data set read in by PROC MODEL in order to specify them in the Breusch = ( ) option. I hope this helps.

Online Status	Offline
Date Last Visited	‎08-15-2025 07:35 PM

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: heteroscedasticity in daily time series

Re: VARMAX with proc reg

Re: proc autoreg direction of AR1 estimate

Re: Forecasting model for volatile data

Re: Forecasting model for volatile data

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: proc autoreg direction of AR1 estimate

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: Transfer Function Identification/Estimation with Differently Diffe...

Re: heteroscedasticity in daily time series

Re: VARMAX with proc reg

Re: proc autoreg direction of AR1 estimate

Re: VARMAX with proc reg

Re: Detecting Volatility in ARIMA model residuals

Re: Detecting Volatility in ARIMA model residuals

Re: Detecting Volatility in ARIMA model residuals

Re: Detecting Volatility in ARIMA model residuals