BookmarkSubscribeRSS Feed
samp945
Obsidian | Level 7

Hello all,

 

I have count data (deaths and several other outcomes) for a large US state over a 10-year period. I am using Proc UCM to create plots showing actual vs. predicted count outcomes over this time and how the introduction of the pandemic in late 2019/early 2020 affected these various outcomes over the very short term (months or weeks) and the long-term.

 

I am trying to choose between forecasting counts  vs. rates. The examples I've come across for similar research use counts without mentioning how such forecasts would be affected by a growing population (i.e., certain count outcomes, such as vehicle collisions and crime, would be affected by a growing population).

 

It seems easy enough to use population data to create rates. The problem I see with this is population data are collected only once every 10 years (census). I can get state-level intercensal estimates for each year. But for a monthly time-series analysis, I need monthly population estimates. I can create these by linear interpolation, but the assumption of a linear population increase across yearly estimates is not a good one.

 

So it seems to me that forecasting population counts vs rates both involve known error. Is there a best practice for this sort of analysis? I welcome any thoughts on how to justify one approach vs. the other. Thanks for your consideration.

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and save with the early bird rate—just $795!

Register now

Discussion stats
  • 0 replies
  • 1597 views
  • 0 likes
  • 1 in conversation