<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Size of holdout sample in forecasting in SAS Forecasting and Econometrics</title>
    <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113528#M620</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am forecasting time series data with hpfdiagnose and I'm running into a problem with the size of my holdout sample. The code runs quickly (&amp;lt;20 sec) when my holdout sample is 5-9% of data, but the forecasting starts to take incredibly long when I increase the size of my holdout sample to higher values (e.g., 10-20%).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;What do you think exaplains this non-linear relationship between sample size and running time? Is there something that I can do about it?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Best,&lt;/P&gt;&lt;P&gt;JMC&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 16 Aug 2013 18:21:53 GMT</pubDate>
    <dc:creator>JMC</dc:creator>
    <dc:date>2013-08-16T18:21:53Z</dc:date>
    <item>
      <title>Size of holdout sample in forecasting</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113528#M620</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am forecasting time series data with hpfdiagnose and I'm running into a problem with the size of my holdout sample. The code runs quickly (&amp;lt;20 sec) when my holdout sample is 5-9% of data, but the forecasting starts to take incredibly long when I increase the size of my holdout sample to higher values (e.g., 10-20%).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;What do you think exaplains this non-linear relationship between sample size and running time? Is there something that I can do about it?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Best,&lt;/P&gt;&lt;P&gt;JMC&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 16 Aug 2013 18:21:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113528#M620</guid>
      <dc:creator>JMC</dc:creator>
      <dc:date>2013-08-16T18:21:53Z</dc:date>
    </item>
    <item>
      <title>Re: Size of holdout sample in forecasting</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113529#M621</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello -&lt;/P&gt;&lt;P&gt;In a way I find your findings counterintuitive, as I would expect faster run times when increasing the holdout sample values.&lt;/P&gt;&lt;P&gt;Would you mind to share your code and some test data to replicate your findings?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Udo&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 Aug 2013 14:54:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113529#M621</guid>
      <dc:creator>udo_sas</dc:creator>
      <dc:date>2013-08-19T14:54:31Z</dc:date>
    </item>
    <item>
      <title>Re: Size of holdout sample in forecasting</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113530#M622</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello Udo,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for your response. Unfortunately, my data is proprietary so I cannot share it. However, I have found the following.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My time series data is based on daily data. When seasonality=365, the size of the holdout sample has a large impact on how long the process takes. When seasonality=7, this no longer occurs. Why I had originally set seasonality =365 is because my data has this interesting pattern where every year there is an almost clockwork-like increase of values from the previous years. In other words, the pattern of data within a year are almost perfectly replicated the following year, but their absolute values are all higher relative to the previous year. I found that by setting seasonality=365, I was able to forecast this yearly step-wise function. With seasonality=7, it doesn't work.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Do you have any suggestions for how I could model seasonality=7 and still get the yearly jumps? I've tried adding a year regressor, but this doesn't seem to be doing the trick.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Best,&lt;/P&gt;&lt;P&gt;Juan Manuel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 22 Aug 2013 15:40:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113530#M622</guid>
      <dc:creator>JMC</dc:creator>
      <dc:date>2013-08-22T15:40:12Z</dc:date>
    </item>
    <item>
      <title>Re: Size of holdout sample in forecasting</title>
      <link>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113531#M623</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello Juan Manuel -&lt;/P&gt;&lt;P&gt;Since your data is on daily frequency, seasonality=7 seems to be the right choice.&lt;/P&gt;&lt;P&gt;Of course this does not address your question about modeling the second seasonality you have discovered in your data.&lt;/P&gt;&lt;P&gt;I certainly understand your concern about proprietary data, but without seeing the data my advise has to stick to conceptual ideas only. &lt;/P&gt;&lt;P&gt;When you say that "there is an almost clockwork-like increase of values from the previous years.", would you describe this pattern as a monthly cycle or a weekly cycle - or do you see level shifts across several years?&lt;/P&gt;&lt;P&gt;What I'm getting at is the fact the you should be able to define either discrete events such as &lt;SPAN style="mso-fareast-font-family: 'Times New Roman';"&gt;calendar events Jan-Dec or Week1-Week52 to model this effect. Alternatively you may want to introduce an adjustment variable which mimics the level shifts. Yet another approach might be to model your data in a 2 step manner: model on daily frequency, model on monthly frequency, and then reconcile both forecasts using the HPFTEMPRECONCILE procedure.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="mso-fareast-font-family: 'Times New Roman';"&gt;Hope this is useful.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="mso-fareast-font-family: 'Times New Roman';"&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="mso-fareast-font-family: 'Times New Roman';"&gt;Udo&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="mso-fareast-font-family: 'Times New Roman';"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 23 Aug 2013 17:50:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Forecasting-and-Econometrics/Size-of-holdout-sample-in-forecasting/m-p/113531#M623</guid>
      <dc:creator>udo_sas</dc:creator>
      <dc:date>2013-08-23T17:50:19Z</dc:date>
    </item>
  </channel>
</rss>

