I want to test the equality of 2 proportions, but I want to do it for each level of a nominal, polychotomous variable. For example, gender (dichotomous) and race (polychotomous). I want to compare the proportion of males that are caucasian, for example, to the proportion of females that are caucasian, and so on. I want the same comparison for each level of the polychotomous variable race. If there are 5 levels of race, I want 5 comparisons w/ p-values, difference in proportion and CI of the difference.
I understand that if the 2 variables are dichotomous I can use PROC FREQ w/ the CHISQ and RISKDIFF options for the statistical significance, the difference between proportions and the CI for that difference, but is there a way to perform this analysis without having to dichotomize a polychotomous variable?
Thanks!!
Your BY variables shouldn't be in the TABLES statement
Include the polychotomous variable as a BY variable.
When I do that I get the following NOTES in the log:
NOTE: No statistics are computed for race_ethn * gender because all data are missing.
NOTE: The above message was for the following BY group:
race_ethn=.
NOTE: No statistics are computed for race_ethn * gender because race_ethn has less than 2 nonmissing levels.
NOTE: The above message was for the following BY group:
race_ethn=Caucasian
NOTE: No statistics are computed for race_ethn * gender because race_ethn has less than 2 nonmissing levels.
NOTE: The above message was for the following BY group:
race_ethn=Black/African American/Hispanic
NOTE: No statistics are computed for race_ethn * gender because race_ethn has less than 2 nonmissing levels.
NOTE: The above message was for the following BY group:
race_ethn=Native American/Asian/Pacific Islander
NOTE: No statistics are computed for race_ethn * gender because race_ethn has less than 2 nonmissing levels.
NOTE: The above message was for the following BY group:
race_ethn=Other
NOTE: There were 2774 observations read from the data set SURVEY.CSCSP_2016_POST_SORT.
Your BY variables shouldn't be in the TABLES statement
Possion regression might do this. proc genmod + offset= option
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.