BookmarkSubscribeRSS Feed
Kanyange
Fluorite | Level 6

Hi All,

I have around 1500 variables, I would like to check the correlation between variabes before I start using the variables for modelling. is there any smarter way to do it that doesn't take a long time to run, basically I wan't to remove anthing with a high correlation  > 0.8 for example! Kind of macro but doesn't use do loop as I have millions of rows (8 Millioms) to process and our system is quite slow

Your help would be much appreciated..

Many Thanks


4 REPLIES 4
PaigeMiller
Diamond | Level 26

is there any smarter way to do it that doesn't take a long time to run,

I'm going to say the answer is NO.

The reason I say this is that, in my opinion, looking at correlations is not the smartest way to proceed here. A better way to proceed, in my opinion, is to fit a PLS model to the data and determine combinations of your original variables that are predictive of the response. PLS takes into account the correlations between your variables better than most modelling techniques. Your method of looking at correlations between variables ignores whether or not the variable is predictive of the response.

--
Paige Miller
Kanyange
Fluorite | Level 6

Thank you Paige...any documentation on PLS results interpretation?

Kanyange
Fluorite | Level 6

Thank you..so what do you think of Proc Reg to get the VIF?

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 4 replies
  • 1153 views
  • 6 likes
  • 2 in conversation