The UCM procedure analyzes and forecasts equally-spaced univariate time series data by using an unobserved components model (UCM). The UCMs are also called structural time series models in the time series literature. A UCM decomposes the response series into components such as trend, seasonals, cycles, and the regression effects due to predictor series.
You can use the OUTFOR= option in the FORECAST statement to store the series and component forecasts produced by the procedure. The OUTFOR= dataset will contain many variables (corresponding to the components) with an ‘F_’ prefix and many variables with an ‘S_’ prefix where the ‘F’ is for ‘Filtered’ (or ‘Forecasted’) and the ‘S’ is for ‘Smoothed’. With ODS GRAPHICS you can also produce time series plots of the filtered component estimates, as well as time series plots of the smoothed component estimates.
Users of the UCM procedure will wonder what the difference is between the filtered and smoothed values of the components. That’s what this article is about!
What’s the difference between (Kalman) filtering and (Kalman) smoothing in the context of UCMs?
The UCMs considered in PROC UCM can be thought of as special cases of more general models, called (linear) Gaussian state space models (GSSM). A UCM formulated as a GSSM has essentially two equations. The first equation, called the observation equation, relates the response series y(t) to a state vector α(t) that is usually unobserved. The state vector contains the component estimates and their variance. The state vector is contained and updated in a second (recursive) equation, appropriately called the state transition equation. The state and other elements of the transition equation provide a rule for getting the model and predictions from one interval to the next. Thus, the transition equation describes the evolution of the state vector in time.
The state space formulation of a UCM has many computational advantages. In this formulation there is a convenient algorithm for estimating and forecasting the unobserved states / components (i.e. state vector α(t) at any time t) by using the observed series y(t), being the DKFS algorithm (DKFS = Diffuse Kalman Filtering and Smoothing).
Kalman filtering and Kalman smoothing: What is it, what's the difference and what is it used for?
Kalman filtering-smoothing (KFS) is a fundamental tool in statistical time-series analysis, especially in STATE SPACE MODELLING (SAS/ETS PROC SSM).
Remember, Unobserved Components Models (UCMs) can also be expressed as state space models. In SAS, UCMs (also known as Structural Time Series Models) are formulated as a state space model.
The UCM and SSM procedures use the same Kalman filtering/smoothing algorithms (despite different descriptions in their docs).
The biggest difference between Kalman filtering and Kalman smoothing is that in Kalman filtering the recursive state estimation moves forward through the data and in Kalman smoothing the recursive state estimation moves backward through the data (in the opposite direction of the time variable).
Kalman filtering is a forward pass through the data. Kalman smoothing ADDS a backward pass through the data. Smoothing is thus an add-on to filtering.
Kalman filtering uses all the data up to the current time point and can be done in real-time (given data so far). Kalman smoothing is offline post-processing and uses all the data.
Kalman filtering has two steps: prediction at the current observation using all prior observations followed by update/filtering using the actual current observation.
If your only goal would be forecasting of the response variable Y and the state vector, Kalman filtering is sufficient.
Good reasons for Kalman smoothing are:
For the response variable (Y), the smoothed estimates are identical to the actual values where they are non-missing.
In the PROC UCM output dataset, the component F_* and VF_* variables are coming from the filtering, the component S_* and VS_* variables are coming from the additional smoothing step. (The V is for Variance).
To learn more:
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning and boost your career prospects.