Hello!
Sorry if my question seem basic but i am new to statistical analysis and would to ask which is the best test to determine association between variables and outcome if:
1. I have a small sample size ~15
2. 1 binary independent variable
3. A mixture of continuous and categorical dependent variables
TIA!
With a sample size of ~15, no test will have a lot of power.
For continuous dependent variables, I would use PROC GLM Analysis of Variance (assuming the errors are iid normal), and PROC NPAR1WAY otherwise.
For categorical variables, I would use a contingency table in PROC FREQ and do the Chi-Squared test.
One more choice, based on the small sample size, would be to use the EXACT option in PROC LOGISTIC. However (and this is critical), you need to look at the cross tabulation of your results versus the categorical variables. With this small sample size, you are almost certain to have what is called complete or partial separation. I would suggest removing from the analysis any variables that show complete separation - you know what the outcome will be once you have the value of the independent categorical variable. If you still run into partial separation issues, then you might consider using PROC GENMOD and the alternating logistic regression method outlined here0
SteveDenham
Good point @Ksharp .for the categorical variables. I guess I got caught up in the continuous side, where modeling makes some sense, even with a sample size as small as this.
SteveDenham
@duckypooh wrote:
Hello!
Sorry if my question seem basic but i am new to statistical analysis and would to ask which is the best test to determine association between variables and outcome if:
1. I have a small sample size ~15
2. 1 binary independent variable
3. A mixture of continuous and categorical dependent variables
TIA!
What type of research question is to be answered? Question will point towards choice of test(s).
Are your categorical variables possibly ordinal? That means an order could be determined "Bad neutral Good" as apposed to simple category " Fred Orange XYZZY"?
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.