About rselukar

rselukar · ‎09-13-2016

I think for this type of problem PROC TIMESERIES (SAS/ETS: http://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_timeseries_toc.htm) and DATA STEP might be a better match. PROC TIMSERIES can take different time series that are recorded at different time instances and put them on a uniform time grid of your choice. It has many options to "fill" the gaps (the ACCUMULATE option) with suitable values, e.g., for your sensor data zero (indicating absence) might be a possible choice. After a data set of all uniformly "filled" series is created, you can do a variety of operation on these columns (summing, and/or, ...) via DATA step. PROC SSM could also be used for the same data preparation step if you want to do a more model based interpolation/extrapolation. For your simpler setup, PROC TIMESERIES might be sufficient and will be whole lot faster.

rselukar · ‎09-10-2016

I was not aware of the traces package. Taking a quick look at the package link I realize that you are interested in more basic analysis of unevenly spaced timeseries. SSM procedure is designed for model based analysis of such timeseries and you can do rather sofisticated analysis of such data. Without going into too many details, I am going to explain how to use the SSM procedure to do basic interpolation/extrapolation of an unevenly spaced timeseries. Assume that your input data set, test, has two columns time and y. The the time column contains the times associated with the measurements y (the times need not be evenly spaced, in fact, you can even have multiple measurements at the same time point). It is also assumed that the data set test is sorted according time, and the time points at which you want interpolated/extrapolated values of y are included in the data set with corresponding y missing. So the first few rows of the data set might look something like this: time y 1.2 . 1.8 -2.6 2.0 . 8.3 1.3 Note y values at time points 1.2 and 2.0 are missing. You can obtain a smooth interpolation of y values as follows: proc ssm data=test; id time; trend scurve(ps(2)) checkbreak; irregular noise; model y = scurve noise / print=smooth; output out=for press; run; The interpolated/extrapolated values of y are printed. SSM procedure stores the estimate of the smoothed curve (called smoothed_scurve) in the output data set. You can plot it and see the fit as follows: proc sgplot data=for; series x=time y=smoothed_scurve; scatter x=time y=y; run; Note that you can specify many more interesting models (see the examples mentioned earlier), use predictor info if available, the CHECKBREAK option in the TREND statement identifies possible locations of abrupt changes in the smoothed curve, and lot more things.

rselukar · ‎08-18-2016

The syntax of PROC SSM is more complex than ARIMA and you might need some time to get used to it. If you need syntax help for converting any ARIMA spec (including transfer function type terms) contact me separately and I will help you.

rselukar · ‎08-18-2016

We cannot let you go back to R without some fight (just kidding)! You can use PROC SSM (as shown below) to get what you want. This procedure is for linear state space modeling (ARIMA models are state space models). See the doc at http://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_ssm_syntax.htm In the code illustration below I have added an MA(1) term to the model since you said your models might have MA terms. Moreover, you can also specify the error variance from your calibrated model. data inputs; input x var1 var2 var3 var4 var5; datalines; 20 5 2 4 5 4 25 12 56 13 44 4 20 5 2 4 5 4 25 12 56 13 44 4 20 5 2 4 5 4 25 12 56 13 44 4 . 2 5 6 5 4 ; /* Without known AR error variance*/ proc ssm data=inputs; trend ar(arma(p=1 q=1)) ar=0.9 ma=0.3; cin = 0.1*var1 + 0.2*var2 + 0.3*var3 + 0.4*var4 + 0.4*var5; state tf(1) sinput=(cin); comp tfTerm = tf[1]; model x = tfTerm ar / print=smooth; output out=for pdv; run; /* With known AR error variance=10 say*/ proc ssm data=inputs; trend ar(arma(p=1 q=1)) ar=0.9 ma=0.3 variance=10; cin = 0.1*var1 + 0.2*var2 + 0.3*var3 + 0.4*var4 + 0.4*var5; state tf(1) sinput=(cin); comp tfTerm = tf[1]; model x = tfTerm ar / print=smooth; output out=for pdv; run;

rselukar · ‎08-17-2016

ARIMA does exit if the number of non-missing observations are less than or equal to the number of parameters in the model whether parameters are known or not. This behavior is known. This use case scenario is somewhat uncommon and ARIMA does not handle it. As you have said, you could add some rows (possibly artificial data) at the beginning to take care of this scenario. By the way, for these types of scenarios Forecast Server offers specialized scoring functionality for ARIMA models that could be of interest to you.

rselukar · ‎07-07-2016

The differencing of "It" must be done in the identify statement: identify var=y crosscorr=It(5); estimate input=( /(1) It ); run; ARIMA will estimate theta0, omega0, and delta1 for you.

rselukar · ‎06-06-2016

You can get the residuals by using the FORECAST statement as follows: forecast outfor=out; The "residual" column in the "out" data set contains the residuals (see the details about OUTFOR= data set at http://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_ucm_details38.htm). By the way, I am not sure what you mean by the "residuals" for various components. It is true that the defining equations for different components do have "disturbance" terms. The estimates of these disturbance terms are NOT output anywhere in the UCM procedure. On the other hand, you do get the estimates of these components (e.g., level, slope, cycle, season, irregular, etc.) themselves, and the variance parameters of these disturbance terms. Please go over the UCM doc carefully as well as have some good book (see the reference section of the doc) handy.

rselukar · ‎03-17-2016

Slope is the rate of change of the level component. The level component tracks the series pattern. You say the series is decreasing so the level component will be decreasing and since it is decreasing the slope will be negative. Series can be decreasing without being negative so the level need not be negative.

rselukar · ‎03-09-2016

Unfortunately, PROC ARIMA will not be able to impose this restriction on thetas. I think your theta_0 can be absorbed in the variance parameter of eta. In ARIMA, theta1 and theta2 (scaled by theta0) satisfy the invertibility condition.

rselukar · ‎03-09-2016

Hello, There is no problem with using BY groups in PROC ARIMA. In order to obtain ML estimates of the MA parameters you must use method=ml option in the estimate statement. However, I do not understand your question very well. It is unclear what you mean by "desmoothing" using the estimated MA parameters. Assuming "returns" are your observed returns, you are fitting a returns = constant + MA(2) model. If you want, (returns - estimated constant) can surve as an estimate of the MA part. Is this what you want? By the way, your mathematical equation does not conform to your ARIMA model specification. Do you mean the unobserved R_t acts like the white noise part of the ARIMA model? ARIMA assumes theta_0 to be 1. There is another procedure, PROC UCM, in SAS/ETS that might be more appropriate for decomposing your returns into a slow moving smooth part and a rougher noise part. See an example of trend removal using HP filter: https://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_ucm_examples05.htm You can also get a decomposition based on an MA(2) component (see the syntax for IRREGULAR component: https://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_ucm_syntax11.htm). UCM is a very general purpose procedure to obtain such decompositions. It supports BY processing also. If you want even more customization, you can use the SSM procedure (https://support.sas.com/documentation/cdl/en/etsug/68148/HTML/default/viewer.htm#etsug_ssm_gettingstarted.htm). However, SSM procedure usually requires more coding. Hope this helps. Hope this helps. Rajesh

rselukar · ‎11-25-2015

Just include a level-shift variable that is zero before the event and 1 at and after the event in the input data set. Use this variable as a regressor. See the Nile level break detection example in the UCM doc.

rselukar · ‎11-19-2015

In this post I will try to address your UCM questions so far. First some general comments: Modeling and forecasting a time series is not easy without some understanding of the series being modeled. Very often several models can be proposed that appear to to fit the historical data reasonably well (his is true of both ARIMA models and UCMs). Model diagnostics (such as residual analysis) is useful but still requires context to decide which of the discovered features of the model are real and which might not be so. Cross-validation type methods, which are very effective in addressing overfitting in the ordinary regression modeling are not as effective in the time series setting. The policy about the handling of the outliers discovered during the exploratory stage is also not quite clear cut and (again) requires context info. In light of these, my personal preference is to try simple models that fit the data reasonably well and not to try to overfit the historical region. Outliers are left unhandled unless they distort the main features (such as trend) of the series. Without additional context, the model given at the end of this post seems adequate to me. Of course, whether the discovered cycle (of period 13 years) is "real" or not cannot be answered without domain info. Now answers to your specific questions: 1. Negative R-square: The R-square in usual ordinary regression is based on "regression residuals" (Y - X beta-hat). The UCM R-square is based on "one-step-ahead" residuals. One-step-ahead residual at a particular time is based on data prior to that time point. Therefore, UCM R-square is not guaranteed to be non-negative (this is mentioned in the UCM doc). Moreover, when the UCM model contains dummy regressors, very often only a few non-missing one-step-ahead residuals are available for residual analysis. This is because non-missing residuals are availble only after adequate number of observations are processed to initialize the diffuse components (which include regressors) in the model. All of your models suffer from this condition of inadequate number of non-missing residuals for residual analysis. 2. You can use the OUTFOR= option in the FORECAST statement to output series forecasts, residuals (their standard errors) and many other things. UCM provides rich graphical support for residual analysis (as you have noticed). If you want to compute some of the statistics you mention by hand, you can use the OUTFOR data set and use PROC IML or PROC UNIVARIATE. My suggested program: proc ucm data=metals; model ZI; irregular; level variance=0 noest checkbreak; slope; cycle plot=smooth; estimate plot=panel; forecast plot=decomp; run;

rselukar · ‎10-29-2015

A constant or trend is NOT included in a UCM model by default. Similarly, an error component (IRREGULAR) is also NOT included by default. This is different from other procedures such as PROC REG or PROC ARIMA. Most often, the modeling of trend is the main purpose so it did not seem reasonable to include a particular type of trend (even a constant) as default. Of course, this type of design decision is a matter of taste. At the time of UCM development, I decided that the simple REG case is rare.

rselukar · ‎09-16-2015

%let NumFactor1 = .5; %let NumFactor2 = -.6; data test_i; input y x date; dx = dif(x); ldx = lag(dx); lx = lag(x); ly = lag(y); cards; 1 14 1 1 13.1 2 2 12 3 2 11.5 4 4 10 5 4 9.9 6 4 8 7 5 7 8 6 6.2 9 . 5 10 . 4 11 . 3.3 12 . 2 13 . 1 14 ; run; data test_d; input y x date; dx = dif(x); ldx = lag(dx); lx = lag(x); ly = lag(y); cards; 6 14 1 6 13.1 2 5 12 3 4 11.5 4 3 10 5 2 9.9 6 2 8 7 1 7 8 1 6.2 9 . 5 10 . 4 11 . 3.3 12 . 2 13 . 1 14 ; run; 　 proc arima data=test_i plots=none; title "Increasing Y"; identify var=y(1) crosscorr=( x(1) ) noprint CLEAR;* CENTER; estimate input =( (1)x ) initval =( &NumFactor1.$(&NumFactor2.)x ) noest NOINT; forecast id=date BACK=0 lead=5 out=out_test_increase printall; run; quit; data test_i; set test_i; retain tmp 0; if _n_ <= 2 then ldx = -0.9; if _n_ <= 1 then dx = -0.9; tfInput = &NumFactor1.*dx - &NumFactor2.*ldx; if ly ^= . then forecast = ly + tfInput; else forecast = tmp + tfInput; tmp = forecast; run; proc print data=test_i; var y ly tfInput forecast; run; 　 proc arima data=test_d plots=none; title "Decreasing Y"; identify var=y(1) crosscorr=( x(1) ) CLEAR;* CENTER; estimate input =( (1)x ) initval =( &NumFactor1.$(&NumFactor2.)x ) noest NOINT; forecast id=date BACK=0 lead=5 out=out_test_decrease printall; run; quit; 　 data test_d; set test_d; retain tmp 0; if _n_ <= 2 then ldx = -0.9; if _n_ <= 1 then dx = -0.9; tfInput = &NumFactor1.*dx - &NumFactor2.*ldx; if ly ^= . then forecast = ly + tfInput; else forecast = tmp + tfInput; tmp = forecast; run; proc print data=test_d; var y ly tfInput forecast; run; I am not quite sure I understand your question but here is what I make of it: I am ignoring the CENTER option in your ARIMA code for simplicity. Your model spec is: identify var=y(1) crosscorr=( x(1) ); estimate input =( (1)x) initval =( &NumFactor1.$(&NumFactor2.)x) noest NOINT; The forecast function for this is: tfInput = NumFactor1*dif(x) - NumFactor2*lag(dif(x)). forecast = lag(y) + tfInput when lag(y) is available = lag(forecast) + tfInput. This does depend on y (and not just on x). *************Verification code attached***************;

rselukar · ‎09-11-2015

1. The combination of LEVEL and SLOPE statements can approximate almost any smooth data pattern (including quadratic). However, the generated forecast function (out-of-sample) is constant or linear. This is adequate in most situations. If you want the forecast function to be quadratic (or higher), you will have to use ARIMA type specification, which can be accomplished by a combination of DEPLAG (to specify differencing) and IRREGULAR (to set AR and MA orders) statements. See the last example in the UCM doc. Of course, you can also use PROC ARIMA. 2. You should not include CYCLE like components in your model without good reason. It is good to keep a few good time series books handy to get better idea about time series modeling.

Online Status	Offline
Date Last Visited	‎12-22-2023 04:12 PM

Re: proc ssm and sarimax

Re: proc ssm and sarimax

Re: proc ssm and sarimax

Re: ARIMA differences in R and SAS

Re: ARIMA differences in R and SAS

Re: Command for Dynamic Factor Analysis?

Re: Dynamic Factor Model for the Yield Curve with exogenous variable

Re: Dynamic Factor Model for the Yield Curve with exogenous variable

Re: Dynamic Factor Model for the Yield Curve with exogenous variable

Re: Proc SSM referencing variable limit based on by group

Re: Command for Dynamic Factor Analysis?

Re: Command for Dynamic Factor Analysis?

Re: proc ssm and sarimax

Re: ARIMA differences in R and SAS

Re: ARIMA differences in R and SAS

Re: Simple analyses on unevenly-spaced timeseries data without convert...

Re: Simple analyses on unevenly-spaced timeseries data without convert...

Re: Why does proc arima with NoEst throw 'There is not enough data to ...

Re: Why does proc arima with NoEst throw 'There is not enough data to ...

Re: Why does proc arima with NoEst throw 'There is not enough data to ...

Re: specifying transfer function in proc arima

Re: How do I retrieve UCM residuals?

Re: Unobserved Components Models and Trend Term

Re: MA(2)

Re: MA(2)

Re: Unobserved Components Model Model Diagnostic

Re: Unobserved Components Model Model Diagnostic

Re: Unobserved Components Model- Inclusion of Trend/Constant

Re: forecasting equation in PROC ARIMA

Re: Unobserved Components Model

SAS Global Forum 2017