BookmarkSubscribeRSS Feed
loredana_cornea
Obsidian | Level 7

I have a dilemma and although I have searched the internet for answers I'm still confused.

I have a dataset with about 400 variables. I would like to reduce their number by applying proc varclus and then retaining a variable/cluster by using the centroid method.

Now, if I understood correctly this procedure is based on the R-squared that implies linearity. It's a powerfull hypothesis that I cannot test on all 400 variables.

My question is, does the procedure work for nonlinear relationships or not?

Are there any papers, as far as you know that treat this subject (proc varclus and non-linearity) that I could read?

Thank you

2 REPLIES 2
gergely_batho
SAS Employee

Yes, proc varclus (like PCA and FACTOR) implies linearity (and also normality).

On the other hand in a datamining context it usually works (unless you have extreme nonlinearities).

Search for "nonlinear PCA", "PROC PRINQUAL" ,"PROC NEURAL" if you worry about nonlinearity.

nonlinear PCA here:

http://support.sas.com/resources/papers/proceedings14/SAS313-2014.pdf

Ksharp
Super User

I don't know if it satisfy your demand .

Check proc corr's the fifth example .

Cronbach’s Coefficient Alpha

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 1030 views
  • 3 likes
  • 3 in conversation