I have a percentile score of how good a hospital is (could be continuous variable, but I categorized it into 5. 0-20%, 21-40%, 41-60%...) and a sensitivity result of how accurately a hospital reports an event (0 to 1, which can also be a percent if x100). I was told since it is not uniformly distributed, we have to use nonparametric analysis, which my graduate program did not go over much. What test would you do for above analysis (for if percentile score of how good a hospital is continuous and categorical)?
Can the Wilcoxon rank sum test be used when a variable has 5 categories and another variable continuous?
If yes, how can I interpret it? So far the internet is mainly showing 2x2.
Can Spearman be used with the above variables?
If you are only interested in testing for a linear association between the two continuous variable (on the percent or proportion scale), then yes, you could use the Pearson or Spearman correlation in PROC CORR to get that test. Do not categorize either variable as that just wastes information. If you think the association might be important, but not necessarily linear, then you could fit a fractional logit model as described in this note. With that model you could even use a spline on the predictor (accuracy measure) to allow for an indeterminate shape on the association. That is done by specifying the SPLINE effect in the EFFECT statement in PROC GLIMMIX or PROC LOGISTIC.
If you are only interested in testing for a linear association between the two continuous variable (on the percent or proportion scale), then yes, you could use the Pearson or Spearman correlation in PROC CORR to get that test. Do not categorize either variable as that just wastes information. If you think the association might be important, but not necessarily linear, then you could fit a fractional logit model as described in this note. With that model you could even use a spline on the predictor (accuracy measure) to allow for an indeterminate shape on the association. That is done by specifying the SPLINE effect in the EFFECT statement in PROC GLIMMIX or PROC LOGISTIC.
Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.