BookmarkSubscribeRSS Feed
samp945
Obsidian | Level 7

Hello all,

 

I have count data (deaths and several other outcomes) for a large US state over a 10-year period. I am using Proc UCM to create plots showing actual vs. predicted count outcomes over this time and how the introduction of the pandemic in late 2019/early 2020 affected these various outcomes over the very short term (months or weeks) and the long-term.

 

I am trying to choose between forecasting counts  vs. rates. The examples I've come across for similar research use counts without mentioning how such forecasts would be affected by a growing population (i.e., certain count outcomes, such as vehicle collisions and crime, would be affected by a growing population).

 

It seems easy enough to use population data to create rates. The problem I see with this is population data are collected only once every 10 years (census). I can get state-level intercensal estimates for each year. But for a monthly time-series analysis, I need monthly population estimates. I can create these by linear interpolation, but the assumption of a linear population increase across yearly estimates is not a good one.

 

So it seems to me that forecasting population counts vs rates both involve known error. Is there a best practice for this sort of analysis? I welcome any thoughts on how to justify one approach vs. the other. Thanks for your consideration.

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

Discussion stats
  • 0 replies
  • 1405 views
  • 0 likes
  • 1 in conversation