Statistical Procedures

Programming the statistical procedures from SAS
BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
lousam
Obsidian | Level 7

  Hello, 

 

I am a new SAS user and I need help with clarifying any of the following concepts:

 

1. When conducting multiple linear regression using mostly continuous predictors, and one categorical predictor that was coded as 0/1 (for instance Sex: 0= male and 1=female), can you use proc reg (since the indicator variable is already created) or do you have to use proc glm with a class statement?

 

Can I use a code like this:

proc reg data=new;

model BMI = Sex Age Height Weight;

run;

 

2. Is there a difference between stepwise selection and lasso selection when trying to identify the best fit model?

 

I appreciate any insight you can offer for any of the questions. 

1 ACCEPTED SOLUTION

Accepted Solutions
WarrenKuhfeld
Ammonite | Level 13

Yes you can use PROC REG with a 0/1 variable.

 

Lasso is one of many variable selection methods.  If you are interested in variable selection, see proc glmselect.

View solution in original post

2 REPLIES 2
WarrenKuhfeld
Ammonite | Level 13

Yes you can use PROC REG with a 0/1 variable.

 

Lasso is one of many variable selection methods.  If you are interested in variable selection, see proc glmselect.

KachiM
Rhodochrosite | Level 12

You may remember:

Since your dependent variable, BMI, is computed using Height and Weight, the inclusion of the latter as independent variables,

will be highly correlated.

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1835 views
  • 2 likes
  • 3 in conversation