Statistical Procedures

Dcicantab5 · Posted 07-09-2016 01:26 PM

Hi,

Need clarification:

Clinical papers often quote p-values obtained from multivariate analysis (most would be by logistic regression). Which p-values do we report as multivariate analysis?

Take the stepwise selection method for example, say for 9 variables, there's p-values in the

1. analysis of effects eligible for entry, even at step 0

2. type 3 analysis of effects

3. summary of stepwise selection

Most of the time, the number of significant variables would have reduced in number when we reach the summary table.

Thank u in advance.

Saiful.

Ksharp · Posted 07-09-2016 10:12 PM

I guess it is "2. type 3 analysis of effects " .

SteveDenham · Posted 07-11-2016 02:10 PM

And be sure to include the summary of stepwise selection, as this brings out all of the multiple comparisons/multiple testing that went on, and unless adjusted for (LASSO or LAAR) renders the type 3 p values inaccurate.

Steve Denham

Dcicantab5 · Posted 07-12-2016 02:55 AM

Hi Steve,
You mean run LASSO under PROC GLMSELECT and then run PROC LOGISTIC based on the selected variables?

To be honest I am not familiar with LASSO but it appears that depending on PROC LOGISTIC alone seems unwise when it comes to variable selection (one depends on intuition and study design and what is so far known about the topic studied)?

SteveDenham · Posted 07-13-2016 01:25 PM

The problem with almost all variable selection methods, other than "expert knowledge", is that the estimates are biased, and that the predictive ability of the model is poor. See http://www.lexjansen.com/pnwsug/2008/DavidCassell-StoppingStepwise.pdf.

Currently, the best automated methods seem to be LASSO based--unless you go the neural net/machine learning route, and that leads to a question of interpretability.

Steve Denham

Statistical Procedures

Which p-values to report as part of multivariate analysis in proc logistic?

Re: Which p-values to report as part of multivariate analysis in proc logistic?

Re: Which p-values to report as part of multivariate analysis in proc logistic?

Re: Which p-values to report as part of multivariate analysis in proc logistic?

Re: Which p-values to report as part of multivariate analysis in proc logistic?

P-value from Proc life test

PROC GEE Year Variable Coding Affects P-Values and Estimates

Logistic Regression

Using PROC DQSCHEME Part 1: Utilizing the CREATE statement

multivariate logistic regression: variable troubleshooting

Follow Us

What is...

Statistical Procedures

Our biggest data and AI event of the year.

Follow Us

What is...