Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- Home
- /
- Analytics
- /
- Stat Procs
- /
- Re: Example Data Set For Showing Superiority of LASSO Over Stepwise Se...

Options

- RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

🔒 This topic is **solved** and **locked**.
Need further help from the community? Please
sign in and ask a **new** question.

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Posted 09-23-2016 06:09 PM
(2680 views)

Hello Everyone,

I want to demonstrate that

a) LASSO regression is superior to stepwise selection for variable selection

b) LASSO regression is superior to linear regression for prediction

I would like to use PROC GLMSELECT in SAS 9.3 to illustrate this. Would anyone have a data set and some code to do so?

If you have just the data set but no code, that's fine - I would be glad to write it by myself.

If you have both the data and the code, that would be even better!

Thanks for your help.

1 ACCEPTED SOLUTION

Accepted Solutions

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

6 REPLIES 6

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Hi Ksharp,

Sorry - I could have phrased that second question better. Suppose I generate 2 different models:

a) one model is obtained from stepwise selection

b) one model is obtained from LASSO

I want to show that the predictive accuracy of Model B is higher than that of Model A.

As Wikipedia notes, LASSO enhances the predictive accuracy of a resulting statistical model.

https://en.wikipedia.org/wiki/Lasso_(statistics)

Would you have an example data set that I can use to demonstrate this?

Thanks.

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Hello Ksharp,

Could you please tell me where you got these statistics? Did you apply those methods to a data set? If so, could you please tell me where that data set comes from?

Thanks.

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

These goodness-fit statistics I referred to is from SAS documentation. There are many example you can work with in PROC GLMSELECT documentation.

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Sorry for the very late reply, Ksharp. I forgot about this thread.

Thank you very much for your help!

Thank you very much for your help!

**SAS Innovate 2025** is scheduled for May 6-9 in Orlando, FL. Sign up to be **first to learn** about the agenda and registration!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.