Dear Colleagues,
I am running a logistic model where the dependent variable is the logit of the probability of a patient being diagnosed with COVID-19. The dataset comprises 4,908 patients, of whom 1,729 tested positive for COVID-19. The dataset includes demographic and clinical information about the patients and information on the predominant variant (original, Gamma, Delta, and Omicron) at the time these patients sought care. In the final model, we identified three statistically significant interactions between the symptoms (runny nose, sore throat, and loss of smell) and the variant type.
I would like to know how to calculate these interactions' power.
Thank you in advance.
Best regards,
Iuri
Dear Dave,
I submitted a paper, and one of the reviewers raised a concern about the lack of a reported power calculation for the sample size: “No formal power calculation/sample size is reported. Given the number of predictors and interactions, it would be helpful to know if the study was adequately powered.”
I would like to discuss the best approach to calculate the power. In fact, we did not select a sample; we used data from all patients who sought assistance at a Primary Care Unit.
Therefore, I would like to find the best way to solve this problem.
Regard
So you do apparently want statistical power. Power and sample size analysis can be done in PROC POWER for a planned study to be analyzed using a logistic model. See the example in the PROC POWER documentation. Note that PROC POWER is for prospective power analysis, not for a retrospective study that has already been designed and analyzed. Results from such a prospective analysis done in advance of you collecting data is apparently what your reviewer is asking for.
Here is a webpage on power analysis of regression models with interactions that I have found: Power Analyses for Interaction Effects in Cross-Sectional Regressions • InteractionPoweR.
By the way, the analysis of power is a rather tricky aspect of regression. Research on this field has generally be confined to very simple cases. For instance, despite the popularity of comparing three and more survival curves in survival analysis, the version of SAS I am using (9.4 M5) only supports calculation of power of the test of comparing of two survival curves. So it would be better if you first searched for the method suitable for your problem on the Internet and then posted them here for questions of the implementation of this method in SAS. Also, be prepared that maybe there isn't a solution to your problem yet.
Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.
Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.