SAS Optimization, and SAS Simulation Studio

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

01-16-2008 11:25 AM

I’m conducting logistic regression using proc logistic on the sample consisting of approximately 150000 people described by 1500 variables. The analysis lasts for about 8 hours. Do you know if there is any methodical way to speed it up? Or is it rather a software/hardware problem?

Thanks a lot.

Regards

Iryna

Thanks a lot.

Regards

Iryna

- Mark as New
- Bookmark
- Subscribe
- RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to 8378

01-18-2008 04:26 AM

Hi Iryna.

I don't think you really need all these 1500 variables to be used in the model, do you ?

So I'd rather use both SELECTION=FORWARD and STOP=50 to see which variables are the (at most) fifty best-contributing to your model, and then rerun the model with them...

Regards

Olivier

I don't think you really need all these 1500 variables to be used in the model, do you ?

So I'd rather use both SELECTION=FORWARD and STOP=50 to see which variables are the (at most) fifty best-contributing to your model, and then rerun the model with them...

Regards

Olivier

- Mark as New
- Bookmark
- Subscribe
- RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Olivier

01-18-2008 11:30 AM

Are any of these 1,500 variables highly correlated? If so, you might be able to select one among a group of highly correlated variables or use a small number of principal components (from a Principal Components Analysis) for your logistic regression.

- Mark as New
- Bookmark
- Subscribe
- RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to 8378

01-22-2008 06:20 AM

iryna, i think you don't need all 150000 records/observations either.

for example, if you are interested in variables that rate respondents' ratings of certain job attributes, you may want to use the data for employed respondents only.

for example, if you are interested in variables that rate respondents' ratings of certain job attributes, you may want to use the data for employed respondents only.

- Mark as New
- Bookmark
- Subscribe
- RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to 8378

01-22-2008 11:34 AM

Is this question related to Mathematical Optimization and Operations Research with SAS? If not, this is the wrong forum.