<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables in SAS Forecasting and Econometrics</title>
    <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987703#M5062</link>
    <description>&lt;P&gt;In my case, I also only use 2 regions, just like you mentioned.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Btw, I have a follow-up question regarding the implementation in SAS PROC VARMAX. I’m trying to model the dummy variables as exogenous variables. However, when I run the procedure using only my in-sample data (e.g., 100 observations), SAS does not produce any forecast output. I then tried to generate the forecasts manually using the estimated coefficients, but the forecast values gradually became smaller and even turned negative over time. I’m wondering whether I made a mistake in the data setup. Should the dataset already include the future time ID rows (with missing values for Y but fill the future dummy (X) values with 0s)?&lt;/P&gt;</description>
    <pubDate>Mon, 11 May 2026 11:12:08 GMT</pubDate>
    <dc:creator>tugasakhir</dc:creator>
    <dc:date>2026-05-11T11:12:08Z</dc:date>
    <item>
      <title>Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987476#M5057</link>
      <description>&lt;P&gt;I’m currently working on a multivariate time series forecasting project. Initially, I developed a VAR (Vector Autoregression) model, but the forecast error was significantly high. Upon diagnostic checking, I noticed large residuals at specific time points, indicating the presence of outliers that the endogenous variables couldn't explain.&lt;/P&gt;&lt;P&gt;To address this, I’m shifting to a VARX model. My plan is to use dummy variables as exogenous inputs (X), where the dummy equals 1 at the specific time points where the residuals are outliers and 0 otherwise. My question is should these dummy variables be included in both the training and testing datasets?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any advice or best practices for handling "Outlier Dummies" would be greatly appreciated!&lt;/P&gt;</description>
      <pubDate>Thu, 07 May 2026 13:46:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987476#M5057</guid>
      <dc:creator>tugasakhir</dc:creator>
      <dc:date>2026-05-07T13:46:35Z</dc:date>
    </item>
    <item>
      <title>Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987558#M5058</link>
      <description>&lt;P&gt;I guess you are using PROC VARMAX. Correct?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-processed="true" data-subtree="aimfl,mfl"&gt;The&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG class="Yjhzub" data-processed="true" data-sfc-cb="" data-sfc-root="c"&gt;PROC VARMAX&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;procedure in SAS is primarily used for multivariate time series analysis and modeling, but it does &lt;STRONG&gt;NOT&lt;/STRONG&gt; have a single dedicated and automated &lt;STRONG&gt;OUTLIER statement&lt;/STRONG&gt; like some univariate procedures (e.g., PROC ARIMA or PROC X13).&lt;/SPAN&gt;&lt;SPAN class="uJ19be notranslate" data-processed="true" data-sfc-cb="" data-wiz-uids="vAgI9e_i,vAgI9e_j" data-sfc-root="c"&gt;&lt;SPAN class="vKEkVd" data-processed="true" data-wiz-attrbind="class=vAgI9e_h/TKHnVd" data-animation-atomic=""&gt;&lt;SPAN data-processed="true" aria-hidden="true"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="uJ19be notranslate" data-processed="true" data-sfc-cb="" data-wiz-uids="vAgI9e_i,vAgI9e_j" data-sfc-root="c"&gt;&lt;SPAN class="vKEkVd" data-processed="true" data-wiz-attrbind="class=vAgI9e_h/TKHnVd" data-animation-atomic=""&gt;&lt;SPAN data-processed="true" aria-hidden="true"&gt;You could work with outlier dummies&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;but only in the training data and not in the testing data. Your testing data is historical data, but ultimately your model is used to forecast the future. And the outliers in the future are unknown and cannot be forecast (predicted). So, for an honest assessment of the forecast error after deployment of your model in production ... don't put the outlier dummies in your test data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It is important that (a few) outliers do not disrupt the training process so that you can perform pattern recognition optimally. But then your model is industrialized (deployment in production), and then you really have to hope that outliers no longer occur (a vain hope, of course).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Perhaps you will find some sort of forecast driver that underlies the outliers. If that is the case, you must include that forecast driver as an independent variable in the model. And perhaps you know the future of that independent variable?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;BR, Koen&lt;/P&gt;</description>
      <pubDate>Fri, 08 May 2026 10:17:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987558#M5058</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2026-05-08T10:17:18Z</dc:date>
    </item>
    <item>
      <title>Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987651#M5059</link>
      <description>&lt;P&gt;thankyou&lt;/P&gt;</description>
      <pubDate>Sun, 10 May 2026 03:56:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987651#M5059</guid>
      <dc:creator>tugasakhir</dc:creator>
      <dc:date>2026-05-10T03:56:42Z</dc:date>
    </item>
    <item>
      <title>Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987691#M5061</link>
      <description>&lt;P&gt;My pleasure.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To be honest, I wasn't quite sure what you meant by "testing data". A hold-out set or an out-of-sample region?&lt;BR /&gt;But it doesn’t matter for my answer.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are 1, 2 or 3 regions when you want to model a univariate time series:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;The in-sample (training) region,&lt;/LI&gt;
&lt;LI&gt;the hold-out (validation) region and&lt;/LI&gt;
&lt;LI&gt;the out-of-sample (testing) region.&lt;BR /&gt;(I often drop the latter&amp;nbsp;to ensure there is enough data left for modelling)&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;In any case, it is better to smooth out outliers (additive, level shift) only in the in-sample region to prevent them from distorting parameter estimation and to allow for proper pattern recognition.&lt;/P&gt;
&lt;P&gt;And thus ... it is better to not remove shocks in the hold-out and out-of-sample regions for the reasons mentioned earlier. After all, the hold-out set is&amp;nbsp;meant to mimic "unseen" future data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;BR, Koen&lt;/P&gt;</description>
      <pubDate>Mon, 11 May 2026 09:04:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987691#M5061</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2026-05-11T09:04:12Z</dc:date>
    </item>
    <item>
      <title>Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987703#M5062</link>
      <description>&lt;P&gt;In my case, I also only use 2 regions, just like you mentioned.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Btw, I have a follow-up question regarding the implementation in SAS PROC VARMAX. I’m trying to model the dummy variables as exogenous variables. However, when I run the procedure using only my in-sample data (e.g., 100 observations), SAS does not produce any forecast output. I then tried to generate the forecasts manually using the estimated coefficients, but the forecast values gradually became smaller and even turned negative over time. I’m wondering whether I made a mistake in the data setup. Should the dataset already include the future time ID rows (with missing values for Y but fill the future dummy (X) values with 0s)?&lt;/P&gt;</description>
      <pubDate>Mon, 11 May 2026 11:12:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987703#M5062</guid>
      <dc:creator>tugasakhir</dc:creator>
      <dc:date>2026-05-11T11:12:08Z</dc:date>
    </item>
    <item>
      <title>Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987704#M5063</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/481503"&gt;@tugasakhir&lt;/a&gt;&amp;nbsp;wrote:
&lt;P&gt;Should the dataset already include the future time ID rows (with missing values for Y but fill the future dummy (X) values with 0s)?&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Yes!&lt;BR /&gt;See&lt;BR /&gt;Problem Note&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;I&gt;37474:&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/I&gt;Incorrect or no forecasts are produced when future values of exogenous variables are not provided&lt;BR /&gt;&lt;A href="https://support.sas.com/kb/37/474.html" target="_blank"&gt;https://support.sas.com/kb/37/474.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;So, just put zeros as&amp;nbsp;&lt;SPAN&gt;future values for your outlier dummies.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;BR, Koen&lt;/P&gt;</description>
      <pubDate>Mon, 11 May 2026 11:34:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987704#M5063</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2026-05-11T11:34:57Z</dc:date>
    </item>
    <item>
      <title>Re: Improving VAR Model Accuracy using VARX with Outlier Dummy Variables</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987706#M5064</link>
      <description>&lt;P&gt;ok, thankyou so much for your help.&lt;/P&gt;</description>
      <pubDate>Mon, 11 May 2026 11:51:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Improving-VAR-Model-Accuracy-using-VARX-with-Outlier-Dummy/m-p/987706#M5064</guid>
      <dc:creator>tugasakhir</dc:creator>
      <dc:date>2026-05-11T11:51:08Z</dc:date>
    </item>
  </channel>
</rss>

