BookmarkSubscribeRSS Feed
sassygrl
Calcite | Level 5
Using Proc Reg and Proc GLMselect, I am encountering an issue where the variables that were selected are no longer significant when I re-run the model using only those variables. Example code would be as follows:

proc glmselect data=indata;
model y = / selection=stepwise choose=validate;
run; quit;

proc glm data=indata;
model y = ;
run; quit;

I have tried varying the "selection" (stepwise, lasso, etc. ) and "choose" options, listing the variables in the order in which they were selected, ss1, ss3, all to no avail. Any insights would be greatly appreciated. Thank you!
1 REPLY 1
lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12
Stepwise selection procedures do not necessarily select just "significant" terms in a classical sense. In fact, tests of significance within the stepwise procedures are quite controversial, and p values may not have simple meanings (the extreme multiple testing causes all kinds of problems). Many stepwise selection methods are liberal, so that possible models are not discarded too readily. Thus, all the terms selected may not be significant when using them in a single model (separate procedure) (as if there was no stepwise selection). You can change the selection criteria in various procedures to make it more difficult to include terms.

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1366 views
  • 0 likes
  • 2 in conversation