Hello,
I am new to data analysis and need help with the following.
I have a survey data summary table where i have results from state and area of that state. I am interested in yes response
It looks like following for each question
Participants Yes C.I No C.I.
STATE 7450 88% 83%-93% 12% 10% -14%
Area 668 82 80%-84% 18% 16% -20%
How do I calculate statistical difference in above two and decide if its statistically significant?
I can do it by calculating yes and no response number from percentages given and do chi square test but it wont give a robust results.
Really appreciate input and help.
You don't need to perform any statistical calculations. As long as CIs don't overlap, you can conclude that the results are statistically different. In the example in question, the result is statistically significant as CIs don't overlap.That is how governments interpret data. Correct me if I am wrong
Chi Square is robust...and your N is large enough to obtain reliable estimates.
If you want to explore non-parametric methods, look at PROC NPAR1WAY.
@GARYV wrote:
Hello,
I am new to data analysis and need help with the following.
I have a survey data summary table where i have results from state and area of that state. I am interested in yes response
It looks like following for each question
Participants Yes C.I No C.I.
STATE 7450 88% 83%-93% 12% 10% -14%
Area 668 82 80%-84% 18% 16% -20%
How do I calculate statistical difference in above two and decide if its statistically significant?
I can do it by calculating yes and no response number from percentages given and do chi square test but it wont give a robust results.
Really appreciate input and help.
You don't need to perform any statistical calculations. As long as CIs don't overlap, you can conclude that the results are statistically different. In the example in question, the result is statistically significant as CIs don't overlap.That is how governments interpret data. Correct me if I am wrong
HI , Thank you for reply. I only have output table with percentages and C.I.. I don't have all the raw data set to use SURVEYFREQ procedure.
All you need is the counts or you could recreate the data in an approximate fashion if you have the 2x2 numbers.
@GARYV wrote:
HI , Thank you for reply. I only have output table with percentages and C.I.. I don't have all the raw data set to use SURVEYFREQ procedure.
Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.
Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.