I am trying to do a linear regression to computer whether body weight can predict blood pressure after controlling for age, healthstatus, and physical activity. Here, my DV (blood pressure) is continuous and my IV(body weight) is continuous. Also, I have several covariate IVs; age is continous, healthstatus is categorical (dummy coded 1-5), and physical activity is continous (# hours doing physical activity). To do this, should I use proc glm or proc reg?
I recommend proc glm. There you can use the class statement to dummy code the categorical variables.
You can use also proc reg, but then first you need to pre-process your data (creating dummy variables "manually").
Can you check multicollinearity using proc glm?
You can use STEPWISE BACKWISW ... option of MODEL to get rid of multicollinearity and get the best model .
"Best models" with categorical variables are messy, and those methods in PROC REG really won't help. The categorical dummy variables are by definition collinear.
I would approach this with PROC GLM or MIXED, and look at the chapter on Analysis of Covariance in SAS for Mixed Models, 2nd ed.
Steve Denham
Doc Steve,
Sorry to make you confused . My statistical background was not as good as you were .
Available on demand!
Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.