Below is a chart of proportions over time.
I’d like to add confidence intervals and a trend line through data such as this one. From articles I’ve seen, there seem to be two ways of doing this:
1. calculate the CIs for the observed proportions
2. run a model, and have the trend lines be predicted probabilities from the model and the 95% confidence intervals would go around the predicted probabilities. An example of this approach is in this article - "Columns indicate observed success rates. Points and error bars indicate model-predicted means and 95% confidence intervals, respectively." from https://www.biorxiv.org/content/10.1101/232868v2.full
My question: in a standard reports where there will be many of these charts shown, is one of these a "better" way? I ran a model because I also wanted to test if the trends were statistically significant, and the model also adjusted for clustering. Is this a good enough reason to take approach #2?
Any advice would be much appreciated.
The second plot is sometimes called a "dynamite plot." Data visualization experts generally agree that dynamite plots are not the best way to summarize a d...
You can read more about alternative visualizations in the article "Remaking a panel of dynamite plots."
I think it is a good idea to add CI's for the observed proportion. Be sure to specify on the graph what the intervals represent because there are several ways to visualize uncertainty in point estimates.
It is not clear to me that a trend line from a predictive model is the right way to get those CIs, but you know your data better than I do.
The second plot is sometimes called a "dynamite plot." Data visualization experts generally agree that dynamite plots are not the best way to summarize a d...
You can read more about alternative visualizations in the article "Remaking a panel of dynamite plots."
I think it is a good idea to add CI's for the observed proportion. Be sure to specify on the graph what the intervals represent because there are several ways to visualize uncertainty in point estimates.
It is not clear to me that a trend line from a predictive model is the right way to get those CIs, but you know your data better than I do.
Thank you, Rick! This is very helpful.
Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.