BookmarkSubscribeRSS Feed
Aamirkhan
Calcite | Level 5
In PCA whether to extract the components of the independent variables or both dependent and independent variables.
4 REPLIES 4
hollandnumerics
Pyrite | Level 9
Aamirkhan,
There is a simple correlation you should understand when posting a code-related question:

The probability of an answer is proportional to the amount of background information, SAS code, log and output you post with your question!
Philip R Holland
Holland Numerics: Blog and Forums
http://blog.hollandnumerics.org.uk
PeterClemmensen
Tourmaline | Level 20

Please provide more information on your problem..

Rick_SAS
SAS Super FREQ

Strictly speaking, PCA does not involve a dependent variable. It is a way to choose linear combinations of variables that explain the most variance in the data.

 

You are probably talking about Principle Component Regression (PCR). Most people use PCR to mean "extract components from only the independent variables.  You can read about how to run a principal component regression in SAS.

 

Although I have answered the question you asked, I want to point out that there are statistical drawbacks with this definition of principal component regression,  The better approach, IMHO, is to use PROC PLS to perform partial least squares regression. PLS uses the independent AND the dependent variables to form the components that best explain the variation in BOTH the response and the explanatory variables.

PaigeMiller
Diamond | Level 26

It sure sounds to me like the original question leads to the answer of Partial Least Squares regression.

 

There's no need to perform PCA on the x-variables and a separate PCA on the y-variables, because these two PCAs might find results that are largely uncorrelated with one another. PLS overcomes this problem by finding dimensions of y that are well predicted (at least, as well as the data will allow) by dimensions of x.

--
Paige Miller

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 1400 views
  • 1 like
  • 5 in conversation