<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Does principal component analysis obliterates the possibility of variable selection? in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868866#M42995</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/437457"&gt;@Season&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Does&amp;nbsp;&lt;SPAN&gt;"predictive Partial Least Squares regression" differ from other kinds of PLS?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;You mentioned robust PCA and Kernel PCA. I wonder their differences (e.g. what goals can these methods but not "ordinary PCA" reach) as compared with "ordinary PCA" as well as the differences between the two methods.&lt;/SPAN&gt;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;DIV class="xisDoc-refProc"&gt;
&lt;DIV id="statug_pls000020" class="aa-section"&gt;
&lt;P class="xisDoc-paragraph"&gt;Note that the name "partial least squares" also applies to a more general statistical method that is &lt;STRONG&gt;&lt;EM&gt;not&lt;/EM&gt;&lt;/STRONG&gt; implemented in the procedures PLS , HPPLS and PLSMOD. The partial least squares method was originally developed in the 1960s by the econometrician Herman Wold (&lt;A href="http://127.0.0.1:57666/help/statug.hlp/statug_pls_references.htm#statug_plswold_h66" target="_blank"&gt;1966&lt;/A&gt;) for modeling "paths" of causal relation between any number of "blocks" of variables. However, the (HP)&lt;FONT&gt;PLS&lt;/FONT&gt;&amp;nbsp;and PLSMOD procedures fit only &lt;SPAN&gt;&lt;EM&gt;predictive&lt;/EM&gt;&lt;/SPAN&gt; partial least squares models, with one "block" of predictors and one "block" of responses. If you are interested in fitting more general path models, you should consider using the &lt;FONT&gt;CALIS&lt;/FONT&gt; procedure.&lt;BR /&gt;&lt;BR /&gt;The (R)(K)PCA question was already answered by&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp;.&lt;BR /&gt;&lt;BR /&gt;And although not applicable for your use case ... here are two blogs on k-PCA :&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;SAS® Fast-KPCA: An efficient and innovative nonlinear principal components method&lt;BR /&gt;By Kevin Scott on The SAS Data Science Blog November 9, 2022&lt;BR /&gt;&lt;A href="https://blogs.sas.com/content/subconsciousmusings/2022/11/09/sas-fast-kpca/" target="_blank"&gt;https://blogs.sas.com/content/subconsciousmusings/2022/11/09/sas-fast-kpca/&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Efficient and automated bandwidth selection in SAS® PROC KPCA &lt;BR /&gt;By Kevin Scott on The SAS Data Science Blog November 23, 2022&lt;BR /&gt;&lt;A href="https://blogs.sas.com/content/subconsciousmusings/2022/11/23/bandwidth-selection-sas-proc-kpca/" target="_blank"&gt;https://blogs.sas.com/content/subconsciousmusings/2022/11/23/bandwidth-selection-sas-proc-kpca/&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Koen&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
    <pubDate>Mon, 10 Apr 2023 14:42:12 GMT</pubDate>
    <dc:creator>sbxkoenk</dc:creator>
    <dc:date>2023-04-10T14:42:12Z</dc:date>
    <item>
      <title>Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868735#M42975</link>
      <description>&lt;P&gt;Hello, everyone. I am currently building a multivariate linear regression. I found that significant collinearity exists among several independent variables and the intercept, with the largest condition index reaches to a staggering number of 90. Yet using the COLLINOINT option in MODEL statement of PROC REG, no collinearity was found when the intercept was not considered.&lt;/P&gt;
&lt;P&gt;I have come to known that principal component analysis is one of the ways of tackling collinearity. However, after reading information about that method, I suddenly found out a possible limitation of that method:&amp;nbsp;&lt;STRONG&gt;since (1) in the variable selection process following principal component analysis, it is the principal components, not the original independent variables that are selected,&amp;nbsp;and (2) each and every principal component takes all the independent variables into account more or less, there is no way of "getting rid of" statistically insignificant variables in the variable selection process&lt;/STRONG&gt;. I &lt;U&gt;wonder if my notion were correct&lt;/U&gt;.&lt;/P&gt;
&lt;P&gt;A &lt;STRONG&gt;second question&lt;/STRONG&gt; that follows is: in the case of collinearities among several independent variables and the intercept (no collinearity among the independent variables themselves), &lt;STRONG&gt;is variable standardization&lt;/STRONG&gt; (i.e. transforming the independent variables into variables with the same standard deviation via modules like PROC STANDARD) &lt;STRONG&gt;still a feasible method&lt;/STRONG&gt;, as is the case in&amp;nbsp;&lt;A title="generalized linear models" href="https://support.sas.com/kb/32/471.html" target="_blank" rel="noopener"&gt;generalized linear models&lt;/A&gt;?&lt;/P&gt;
&lt;P&gt;Thank you very much!&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 10:38:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868735#M42975</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-09T10:38:37Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868743#M42976</link>
      <description>&lt;P&gt;Use Partial Least Squares (PROC PLS) for regression, not PCA, when there is collinearity. PLS is relatively robust against the effects of multi-collinearity. Randy Tobias (of SAS Institute) gives &lt;A href="https://support.sas.com/rnd/app/stat/papers/pls.pdf" target="_self"&gt;an example&lt;/A&gt; of PLS creating a useful model with 1,000 highly correlated variables, and he does not have to go through the step of variable selection.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;PCA is the wrong tool for regression (even though there are many examples in the literature). It selects variables to have large loadings not based on whether or not they are good predictors, but based only upon the x-matrix. PLS selects variables to have large loadings based on whether or not they are good predictors, both the x-matrix and y-matrix are used.&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 10:11:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868743#M42976</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2023-04-09T10:11:45Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868757#M42977</link>
      <description>&lt;P&gt;Thank you for your advice! What about ridge regression and LASSO? What are their features (e.g. advantages and disadvantages) compared to partial least squares when it comes to dealing with collinearities in multivariate linear regression?&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 10:36:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868757#M42977</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-09T10:36:11Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868759#M42979</link>
      <description>&lt;P&gt;I really can't answer questions about the Lasso.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The paper by &lt;A href="https://www.jstor.org/stable/1269656" target="_self"&gt;Frank and Friedman&lt;/A&gt;&amp;nbsp;showed that PLS had lower mean square error of the predictions and lower mean square error of the regression coefficients (sometimes by an order of magnitude) compared to variable selection methods and compared to Ridge Regression and compared to Principal Components regression.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Many people have been trained that they MUST do variable selection (which they have also been trained to understand that this is time consuming and difficult to do), and that simply isn't true. There are probably thousands of published papers showing PLS creating useful models without variable selection; and the paper by Frank and Friedman shows that in most cases PLS produces a better model (as measured by the MSE of the predictions and MSE of the regression coefficients).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 10:47:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868759#M42979</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2023-04-09T10:47:20Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868762#M42980</link>
      <description>&lt;P&gt;OK, thank you very much for your information, including the article you cited, advantages of PLS compared to other methods, and your opinion regarding variable selection!&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 11:53:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868762#M42980</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-09T11:53:12Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868853#M42985</link>
      <description>&lt;P&gt;To answer your question directly: principal component analysis does not eliminate any of the original variables. It reduces the dimension of the problem by keeping only a small number of linear combinations.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I think most people would refer to PCA as a &lt;EM&gt;dimension reduction&lt;/EM&gt; method, rather than a variable selection method. As you point out, a PCA model includes&amp;nbsp;&lt;EM&gt;all&lt;/EM&gt; of the original variables in the model. The model keeps the linear combinations that explain most of the variance in the model. The PCA regression&amp;nbsp;&lt;A href="https://blogs.sas.com/content/iml/2017/08/02/retain-principal-components.html" target="_self"&gt;uses some criterion for determining how many principal components to retain&lt;/A&gt;, then includes only a small number of PCs in the model.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 13:05:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868853#M42985</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2023-04-10T13:05:52Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868858#M42987</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Personally, I am also a supporter of PLS, like suggested by&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;.&lt;/P&gt;
&lt;P&gt;I am talking about "predictive Partial Least Squares regression" here (with one input block containing predictors and one output block containing response variables).&amp;nbsp;&lt;BR /&gt;See PROC PLS and PROC PLSMOD.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;With regard to Principal Components Analysis (PCA) ...&lt;BR /&gt;Note that in VIYA two interesting procedures were added :&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;PROC RPCA (Robust PCA)&lt;/LI&gt;
&lt;LI&gt;PROC KPCA (Kernel PCA)&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Cheers,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:07:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868858#M42987</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-04-10T14:07:03Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868859#M42988</link>
      <description>&lt;P&gt;Thank you, Rick, for giving a direct answer to the question I raised in the first place! I come to acknowledge the fact rather than being a variable selection method, principal component analysis is a data-dimension lowering method. Yet I would like to point out the reason why I planned to choose the method in the first place: I wish to tackle collinearity in multivariate linear regression. I set up a higher goal, which is to t&lt;STRONG&gt;ackle collinearity while conducting variable selection simultaneously&lt;/STRONG&gt;. I want my model to be concise. But to the best of my knowledge, principal component analysis does not seem to be able to reach that goal, so I came here to see whether I was wrong. Now that my notion is correct, I am going to abandon principal component analysis and try other methods to reach my goal.&lt;/P&gt;
&lt;P&gt;I would also like to consult on other methods dealing with collinearity. I am interested in &lt;STRONG&gt;the features (e.g. advantages and disadvantages) of partial least squares, ridge regression and LASSO&lt;/STRONG&gt;. I wonder if you could give me a helping hand on that.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:12:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868859#M42988</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-10T14:12:22Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868860#M42989</link>
      <description>&lt;P&gt;Thank you, Koen, for kindly offering me help!&lt;/P&gt;
&lt;P&gt;Your description on PLS is brief. Does&amp;nbsp;&lt;SPAN&gt;"predictive Partial Least Squares regression" differ from other kinds of PLS?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;You mentioned robust PCA and Kernel PCA. I wonder their differences (e.g. what goals can these methods but not "ordinary PCA" reach) as compared with "ordinary PCA" as well as the differences between the two methods.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thank you very much!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:19:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868860#M42989</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-10T14:19:58Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868861#M42990</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;I set up a higher goal, which is to t&lt;/SPAN&gt;&lt;STRONG&gt;ackle collinearity while conducting variable selection simultaneously&lt;/STRONG&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;If you still insist on having a step for variable selection (as I mentioned, it is not a necessary step if you use PLS), then you can interpret PLS as handling collinearity while conducting variable selection simultaneously (the variables with loadings close to zero are not selected, the variable with loadings not close to zero are selected).&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:24:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868861#M42990</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2023-04-10T14:24:13Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868863#M42992</link>
      <description>&lt;P&gt;Robust PCA uses robust estimates of the mean vector and covariance matrix, which means that the PCA is not unduly influenced by outliers in the data, which would otherwise bias the results.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Traditional PCA uses linear combinations of the original variables. Kernel PCA is a way to capture nonlinear combinations. It is mostly used for discriminant analysis and classification, which doesn't seem applicable to your situation.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:28:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868863#M42992</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2023-04-10T14:28:31Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868864#M42993</link>
      <description>&lt;P&gt;Thank you for your reply. Well, actually, I am not "insisting" on variable selection... I just want to try different modeling strategies and pick up a model that works "best" eventually. I am not that concerned on whether variable selection is performed in the process. My top priority is the "goodness" of the model. Conciseness of the model is one of the priorities that follow. When it comes to these issues, models with less variables and similar "goodness" to the "full" model may be better. That is when variable selection could play a more major role.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 15:34:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868864#M42993</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-10T15:34:54Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868865#M42994</link>
      <description>&lt;P&gt;&lt;span class="lia-unicode-emoji" title=":grinning_face:"&gt;😀&lt;/span&gt;Thank you for your explanation!&lt;span class="lia-unicode-emoji" title=":thumbs_up:"&gt;👍&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:39:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868865#M42994</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-10T14:39:33Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868866#M42995</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/437457"&gt;@Season&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Does&amp;nbsp;&lt;SPAN&gt;"predictive Partial Least Squares regression" differ from other kinds of PLS?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;You mentioned robust PCA and Kernel PCA. I wonder their differences (e.g. what goals can these methods but not "ordinary PCA" reach) as compared with "ordinary PCA" as well as the differences between the two methods.&lt;/SPAN&gt;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;DIV class="xisDoc-refProc"&gt;
&lt;DIV id="statug_pls000020" class="aa-section"&gt;
&lt;P class="xisDoc-paragraph"&gt;Note that the name "partial least squares" also applies to a more general statistical method that is &lt;STRONG&gt;&lt;EM&gt;not&lt;/EM&gt;&lt;/STRONG&gt; implemented in the procedures PLS , HPPLS and PLSMOD. The partial least squares method was originally developed in the 1960s by the econometrician Herman Wold (&lt;A href="http://127.0.0.1:57666/help/statug.hlp/statug_pls_references.htm#statug_plswold_h66" target="_blank"&gt;1966&lt;/A&gt;) for modeling "paths" of causal relation between any number of "blocks" of variables. However, the (HP)&lt;FONT&gt;PLS&lt;/FONT&gt;&amp;nbsp;and PLSMOD procedures fit only &lt;SPAN&gt;&lt;EM&gt;predictive&lt;/EM&gt;&lt;/SPAN&gt; partial least squares models, with one "block" of predictors and one "block" of responses. If you are interested in fitting more general path models, you should consider using the &lt;FONT&gt;CALIS&lt;/FONT&gt; procedure.&lt;BR /&gt;&lt;BR /&gt;The (R)(K)PCA question was already answered by&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp;.&lt;BR /&gt;&lt;BR /&gt;And although not applicable for your use case ... here are two blogs on k-PCA :&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;SAS® Fast-KPCA: An efficient and innovative nonlinear principal components method&lt;BR /&gt;By Kevin Scott on The SAS Data Science Blog November 9, 2022&lt;BR /&gt;&lt;A href="https://blogs.sas.com/content/subconsciousmusings/2022/11/09/sas-fast-kpca/" target="_blank"&gt;https://blogs.sas.com/content/subconsciousmusings/2022/11/09/sas-fast-kpca/&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Efficient and automated bandwidth selection in SAS® PROC KPCA &lt;BR /&gt;By Kevin Scott on The SAS Data Science Blog November 23, 2022&lt;BR /&gt;&lt;A href="https://blogs.sas.com/content/subconsciousmusings/2022/11/23/bandwidth-selection-sas-proc-kpca/" target="_blank"&gt;https://blogs.sas.com/content/subconsciousmusings/2022/11/23/bandwidth-selection-sas-proc-kpca/&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Koen&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Mon, 10 Apr 2023 14:42:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868866#M42995</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-04-10T14:42:12Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868868#M42996</link>
      <description>&lt;P&gt;And regarding introductory articles about robust PCA, I wrote about RPCA back in 2010, way before Viya or PROC RPCA were implemented. See &lt;A href="https://support.sas.com/resources/papers/proceedings10/329-2010.pdf" target="_self"&gt;p. 9-10 of Wicklin (2010)&lt;/A&gt;&amp;nbsp;or the blog post&amp;nbsp; from 2017, &lt;A href="https://blogs.sas.com/content/iml/2017/08/09/robust-principal-components-sas.html" target="_self"&gt;"Robust principal component analysis in SAS."&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 15:03:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868868#M42996</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2023-04-10T15:03:35Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868869#M42997</link>
      <description>&lt;P&gt;Thank you very, very much for your kind help, including a more detailed description of PLS and its history! I really appreciate your blending humanities and statistics, with introductions on the history of statistical methods, including those of PLS and joint model for longitudinal and time-to-event data (which you had pointed out previously) attached. For me, scientific history is not just a record of what happened in the past, but also a record of the trajectories of the development of sciences, from which I can summarize the pattern of existing scientific knowledge and disciplines, raise questions or even propose theories. I firmly believe that questions are key elements of the improvement of science, as is the case of the development of&amp;nbsp;elliptic integrals while astronomers tried to calculate the&amp;nbsp;circumference of orbits. In a word (two words, to be exact&lt;span class="lia-unicode-emoji" title=":face_with_tears_of_joy:"&gt;😂&lt;/span&gt;), thank you!&lt;/P&gt;
&lt;P&gt;Still, I am still a green hand on PLS, so I hardly know anything about this method apart from its name. Your interpretation of PLS using "paths and blocks" somehow illuminates the method, but I am still not that clear about it. I am going to read more about PLS. I would like to raise a brief question for the sake of selecting a possible "shortcut": &lt;STRONG&gt;do you think that in the situation I encounter, only&amp;nbsp;&lt;/STRONG&gt;&lt;SPAN&gt;&lt;STRONG&gt;"predictive Partial Least Squares regression", but not other kinds of PLS, is suitable for reaching the goal I previously mentioned (tackle collinearity and conducting variable selection at the same time in a multivariate linear regression)&lt;/STRONG&gt;?&lt;/SPAN&gt;&amp;nbsp;If so, maybe I do not need to know every kind of PLS to reach my goal.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 15:14:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868869#M42997</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-10T15:14:29Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868870#M42998</link>
      <description>&lt;P&gt;OK, thank you for your previous work and kindly offering them to me!&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 15:05:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868870#M42998</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-10T15:05:44Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868879#M42999</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;I would like to raise a brief question for the sake of selecting a possible "shortcut": &lt;STRONG&gt;do you think that in the situation I encounter, only&amp;nbsp;&lt;/STRONG&gt;&lt;SPAN&gt;&lt;STRONG&gt;"predictive Partial Least Squares regression", but not other kinds of PLS, is suitable for reaching the goal I previously mentioned (tackle collinearity and conducting variable selection at the same time in a multivariate linear regression)&lt;/STRONG&gt;?&lt;/SPAN&gt;&amp;nbsp;If so, maybe I do not need to know every kind of PLS to reach my goal.&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Forget about path modeling / path analysis in PROC CALIS.&lt;BR /&gt;What you need is the kind of PLS as fit by PLS / HPPLS / PLSMOD procedures (predictive PLS regression with an input block and an output block). If I am right your output block has only one response variable, so you are doing&amp;nbsp;&lt;EM&gt;multiple regression&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;analysis and NOT multivariate regression!&lt;BR /&gt;PLS&amp;nbsp;method is well suited to tackle problems of &lt;EM&gt;multicollinearity&lt;/EM&gt;&amp;nbsp;.&lt;BR /&gt;You can choose a proper PLS model using Cross-Validation or&amp;nbsp;by Test Set Validation.&lt;BR /&gt;&lt;BR /&gt;If the curse-of-dimensionality is "hitting hard" , you can consider to run the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;VARREDUCE procedure before running PLS.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="xisDoc-refProc"&gt;
&lt;DIV id="casstat.varreduce.details" class="aa-section"&gt;
&lt;P class="xisDoc-paragraph"&gt;The VARREDUCE procedure performs both supervised and unsupervised variable selection. It selects variables by identifying a set of variables that can jointly explain the maximum amount of data variance.&lt;BR /&gt;&lt;BR /&gt;Cheers,&lt;BR /&gt;Koen&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Mon, 10 Apr 2023 15:31:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868879#M42999</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-04-10T15:31:46Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868883#M43000</link>
      <description>&lt;P&gt;Thank you very much, Koen, for your more detailed explanation!&lt;/P&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/60547"&gt;@sbxkoenk&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;I would like to raise a brief question for the sake of selecting a possible "shortcut": &lt;STRONG&gt;do you think that in the situation I encounter, only&amp;nbsp;&lt;/STRONG&gt;&lt;SPAN&gt;&lt;STRONG&gt;"predictive Partial Least Squares regression", but not other kinds of PLS, is suitable for reaching the goal I previously mentioned (tackle collinearity and conducting variable selection at the same time in a multivariate linear regression)&lt;/STRONG&gt;?&lt;/SPAN&gt;&amp;nbsp;If so, maybe I do not need to know every kind of PLS to reach my goal.&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;If I am right your output block has only one response variable, so you are doing&amp;nbsp;&lt;EM&gt;multiple regression&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;analysis and NOT multivariate regression!&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;I am not sure what the noun "output block" means, but the model I am attempting to build&amp;nbsp;&lt;EM&gt;does&lt;/EM&gt; contain merely one dependent variable. I know that Bayesian neural network and&amp;nbsp;structural equation model can be used to deal with situations in which more than one dependent variable is modeled. These situations are far more complicated than the one I am encountering. I am building a regression model with only &lt;STRONG&gt;one &lt;/STRONG&gt;dependent variable.&lt;/P&gt;
&lt;P&gt;I wonder whether a composite (more than one) dependent variable, instead of solely one dependent variable is involved in the modeling equals "output block has more than one response variable". But anyway, I appreciate your pointing out the subtle yet maybe significant difference of my nomenclature of the analysis I am attempting to perform.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Apr 2023 00:00:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868883#M43000</guid>
      <dc:creator>Season</dc:creator>
      <dc:date>2023-04-11T00:00:05Z</dc:date>
    </item>
    <item>
      <title>Re: Does principal component analysis obliterates the possibility of variable selection?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868889#M43001</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/437457"&gt;@Season&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;I am not sure what the noun "output block" means, but the model I am attempting to build&amp;nbsp;&lt;EM&gt;does&lt;/EM&gt; contain merely one dependent variable. I know that Bayesian neural network and&amp;nbsp;structural equation model can be used to deal with situations in which more than one dependent variable is modeled. These situations are far more complicated than the one I am encountering. I am building a regression model with only &lt;STRONG&gt;one &lt;/STRONG&gt;dependent variable.&lt;/P&gt;
&lt;P&gt;I wonder whether a composite (more than one) dependent variable, instead of solely one dependent variable is involved in the modeling equals "output block has only one response variable". But anyway, I appreciate your pointing out the subtle yet maybe significant difference of my nomenclature of the analysis I am attempting to perform.&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;PROC PLS , HPPLS and PLSMOD can also deal with more than one dependent variable.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Partial Least Squares (PLS) is very popular in process manufacturing as an efficient approach for multivariate statistical process monitoring. &lt;BR /&gt;Multivariate Analysis (MVA) can mean two things:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;there are only input variables (correspondence analysis, PCA, Factor analysis, &lt;SPAN&gt;Multidimensional scaling (MDS)&amp;nbsp;&lt;/SPAN&gt;...)&lt;/LI&gt;
&lt;LI&gt;if there is an output block, then Multivariate Analysis (MVA) means "solving problems where more than one dependent variable is analyzed" (all dependent variables are simultaneously explained / predicted in just one model).&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;But PROC PLS works perfectly fine as well if your output block has only one variable!&lt;/STRONG&gt;&lt;BR /&gt;Partial Least Squares was for example available in Enterprise Miner 15.2 (but it is no longer available in VIYA Visual Data Mining and Machine Learning with Model Studio).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good luck,&lt;BR /&gt;Koen&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 16:08:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Does-principal-component-analysis-obliterates-the-possibility-of/m-p/868889#M43001</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-04-10T16:08:14Z</dc:date>
    </item>
  </channel>
</rss>

