BookmarkSubscribeRSS Feed
Dcicantab5
Obsidian | Level 7

Hi,

Need clarification:

Clinical papers often quote p-values obtained from multivariate analysis (most would be by logistic regression). Which p-values do we report as multivariate analysis? 

 

Take the stepwise selection method for example, say for 9 variables, there's p-values in the

1. analysis of effects eligible for entry, even at step 0 

2. type 3 analysis of effects

3. summary of stepwise selection

 

Most of the time, the number of significant variables would have reduced in number when we reach the summary table.

 

Thank u in advance.

 

Saiful.

4 REPLIES 4
Ksharp
Super User
I guess it is "2. type 3 analysis of effects " .
SteveDenham
Jade | Level 19

And be sure to include the summary of stepwise selection, as this brings out all of the multiple comparisons/multiple testing that went on, and unless adjusted for (LASSO or LAAR) renders the type 3 p values inaccurate.

 

Steve Denham

Dcicantab5
Obsidian | Level 7
Hi Steve,
You mean run LASSO under PROC GLMSELECT and then run PROC LOGISTIC based on the selected variables?

To be honest I am not familiar with LASSO but it appears that depending on PROC LOGISTIC alone seems unwise when it comes to variable selection (one depends on intuition and study design and what is so far known about the topic studied)?
SteveDenham
Jade | Level 19

The problem with almost all variable selection methods, other than "expert knowledge", is that the estimates are biased, and that the predictive ability of the model is poor.  See http://www.lexjansen.com/pnwsug/2008/DavidCassell-StoppingStepwise.pdf.

 

Currently, the best automated methods seem to be LASSO based--unless you go the neural net/machine learning route, and that leads to a question of interpretability.

 

Steve Denham

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 2797 views
  • 3 likes
  • 3 in conversation