Hello,
I am new to data analysis and need help with the following.
I have a survey data summary table where i have results from state and area of that state. I am interested in yes response
It looks like following for each question
Participants Yes C.I No C.I.
STATE 7450 88% 83%-93% 12% 10% -14%
Area 668 82 80%-84% 18% 16% -20%
How do I calculate statistical difference in above two and decide if its statistically significant?
I can do it by calculating yes and no response number from percentages given and do chi square test but it wont give a robust results.
Really appreciate input and help.
You don't need to perform any statistical calculations. As long as CIs don't overlap, you can conclude that the results are statistically different. In the example in question, the result is statistically significant as CIs don't overlap.That is how governments interpret data. Correct me if I am wrong
Chi Square is robust...and your N is large enough to obtain reliable estimates.
If you want to explore non-parametric methods, look at PROC NPAR1WAY.
@GARYV wrote:
Hello,
I am new to data analysis and need help with the following.
I have a survey data summary table where i have results from state and area of that state. I am interested in yes response
It looks like following for each question
Participants Yes C.I No C.I.
STATE 7450 88% 83%-93% 12% 10% -14%
Area 668 82 80%-84% 18% 16% -20%
How do I calculate statistical difference in above two and decide if its statistically significant?
I can do it by calculating yes and no response number from percentages given and do chi square test but it wont give a robust results.
Really appreciate input and help.
You don't need to perform any statistical calculations. As long as CIs don't overlap, you can conclude that the results are statistically different. In the example in question, the result is statistically significant as CIs don't overlap.That is how governments interpret data. Correct me if I am wrong
HI , Thank you for reply. I only have output table with percentages and C.I.. I don't have all the raw data set to use SURVEYFREQ procedure.
All you need is the counts or you could recreate the data in an approximate fashion if you have the 2x2 numbers.
@GARYV wrote:
HI , Thank you for reply. I only have output table with percentages and C.I.. I don't have all the raw data set to use SURVEYFREQ procedure.
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.