Hi,
Need clarification:
Clinical papers often quote p-values obtained from multivariate analysis (most would be by logistic regression). Which p-values do we report as multivariate analysis?
Take the stepwise selection method for example, say for 9 variables, there's p-values in the
1. analysis of effects eligible for entry, even at step 0
2. type 3 analysis of effects
3. summary of stepwise selection
Most of the time, the number of significant variables would have reduced in number when we reach the summary table.
Thank u in advance.
Saiful.
And be sure to include the summary of stepwise selection, as this brings out all of the multiple comparisons/multiple testing that went on, and unless adjusted for (LASSO or LAAR) renders the type 3 p values inaccurate.
Steve Denham
The problem with almost all variable selection methods, other than "expert knowledge", is that the estimates are biased, and that the predictive ability of the model is poor. See http://www.lexjansen.com/pnwsug/2008/DavidCassell-StoppingStepwise.pdf.
Currently, the best automated methods seem to be LASSO based--unless you go the neural net/machine learning route, and that leads to a question of interpretability.
Steve Denham
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.