<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: PROC PLS root mean PRESS calculation in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403261#M21036</link>
    <description>&lt;P&gt;Thanks for your replies.&lt;/P&gt;&lt;P&gt;Rick_SAS, I applied your code with the output from&amp;nbsp;my proc PLS model. The root mean PRESS from the (resid[i]/1-h)**2 formula gave the same output for each observation&amp;nbsp;as using the PRESS ouput option from the PROC PLS model and squaring that value. When I take the root mean PRESS (sqrt(sum(press)/count(press))&amp;nbsp;from these manually calculated values and&amp;nbsp;plot them against the RMPRESS table output by PROC PLS by default, it&amp;nbsp;would appear&amp;nbsp;values are&amp;nbsp;close to some&amp;nbsp;logarithmic transformation.&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="PRESS comparison.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/15789iD86216B8460011A5/image-size/large?v=v2&amp;amp;px=999" role="button" title="PRESS comparison.jpg" alt="PRESS comparison.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 11 Oct 2017 17:07:47 GMT</pubDate>
    <dc:creator>jroberts1992</dc:creator>
    <dc:date>2017-10-11T17:07:47Z</dc:date>
    <item>
      <title>PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/402966#M21020</link>
      <description>&lt;P&gt;Dear SAS community,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am running PLS on spectra data and I am trying to clarify how root mean press is calculated in full leave-one-out cross validation. I have the following model:&lt;/P&gt;&lt;P&gt;PROC PLS DATA=df cv=one;&lt;/P&gt;&lt;P&gt;MODEL depvar=nm400-nm1900;&lt;/P&gt;&lt;P&gt;OUTPUT out=outputdata&amp;nbsp;PRESS=press;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As I understand it, the OUTPUT PRESS option gives the residual value between the observed and predicted for each observation when&amp;nbsp;that observation is&amp;nbsp;"held out" from the data set. First, is this correct?&lt;/P&gt;&lt;P&gt;Now if I calculate the root mean press for the data set from the output dataset PRESS values, the value is different than the root mean press for the number of factors which minimizes PRESS. From the analyses I've seen, the root mean PRESS output by SAS seem to vary between 0 and 1 (or at least near 1), so my guess is there is some sort of normalization occuring in these calculations. I can't find any source detail if or&amp;nbsp;how SAS is normalizing the root mean&amp;nbsp;PRESS values. I've also tried a multitude of transformations to try and replicate the root mean PRESS output by the model to no avail.&lt;/P&gt;&lt;P&gt;Can anyone shed any light on this?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;Jordan&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Oct 2017 22:08:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/402966#M21020</guid>
      <dc:creator>jroberts1992</dc:creator>
      <dc:date>2017-10-10T22:08:18Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/402972#M21021</link>
      <description>&lt;P&gt;Last time I checked, the SAS documentation on how PRESS is computed was pretty weak. Now maybe it has improved, but one thing I will state as a definite fact, is that I am too lazy to check. I will point out that as I was going to check, I came across this horrible mis-documentation at &lt;A href="http://support.sas.com/documentation/onlinedoc/stat/indexproc.html#stat143" target="_blank"&gt;http://support.sas.com/documentation/onlinedoc/stat/indexproc.html#stat143&lt;/A&gt; and I threw my hands up and stopped right there.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2017-10-10 18_16_20-SAS_STAT Procedures.png" style="width: 499px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/15760iDE66ABB11A2C4689/image-size/large?v=v2&amp;amp;px=999" role="button" title="2017-10-10 18_16_20-SAS_STAT Procedures.png" alt="2017-10-10 18_16_20-SAS_STAT Procedures.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/4"&gt;@ChrisHemedinger&lt;/a&gt;, to whom should we complain so this can be fixed?&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;As I understand it, the OUTPUT PRESS option gives the residual value between the observed and predicted for each observation when&amp;nbsp;that observation is&amp;nbsp;"held out" from the data set. First, is this correct?&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;As I understand what you're saying, I believe it to be correct. I would make a minor wording change to say "...when that observation is 'held out' from the PLS model".&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;From the analyses I've seen, the root mean PRESS output by SAS seem to vary between 0 and 1 (or at least near 1), so my guess is there is some sort of normalization occuring in these calculations.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am not aware of this statistic being normalized somehow; even the name "Predicted Residual Sum-of-Squares" indicates it is a sum (of squares) and not some normalized quantity.&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 11:56:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/402972#M21021</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2017-10-11T11:56:10Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403111#M21025</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;- I've passed your comment on to the person in R&amp;amp;D who maintains this page.&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 11:56:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403111#M21025</guid>
      <dc:creator>ChrisHemedinger</dc:creator>
      <dc:date>2017-10-11T11:56:21Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403112#M21026</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/4"&gt;@ChrisHemedinger&lt;/a&gt;&amp;nbsp;As they say here in Buffalo, NY,&amp;nbsp;&lt;EM&gt;muchas gracias&lt;/EM&gt;.&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 11:59:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403112#M21026</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2017-10-11T11:59:16Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403158#M21027</link>
      <description>&lt;P&gt;Speaking as a Buffalo native,&amp;nbsp;&lt;EM&gt;fuggedaboutit.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 13:27:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403158#M21027</guid>
      <dc:creator>ChrisHemedinger</dc:creator>
      <dc:date>2017-10-11T13:27:27Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403166#M21028</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/10892"&gt;@PaigeMiller&lt;/a&gt;&amp;nbsp;- doc page is now fixed!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="pls.png" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/15778iF41FE03ECFA19FBE/image-size/large?v=v2&amp;amp;px=999" role="button" title="pls.png" alt="pls.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 13:50:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403166#M21028</guid>
      <dc:creator>ChrisHemedinger</dc:creator>
      <dc:date>2017-10-11T13:50:12Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403214#M21029</link>
      <description>&lt;P&gt;The PRESS statistic is documented among the residual and influence statistics. PROC REG has a little bit, but the main reference is the &lt;A href="http://go.documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_introreg_sect036.htm&amp;amp;docsetVersion=14.3&amp;amp;locale=en" target="_self"&gt;"Predicted and Residual Values" section &lt;/A&gt;of the "Introduction to Regression Procedures" chapter. The doc says&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;"The predicted residual for observation i is defined as the residual for the ith observation that results from dropping the ith observation [and refitting the&amp;nbsp;model]. The sum of squares of predicted residual errors is called the PRESS statistic:"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In practice, there is no need to refit the model since you can efficiently compute the predicted residual by the formula&lt;/P&gt;
&lt;P&gt;PResid[i] = Resid[i] / (1-h[i])&lt;/P&gt;
&lt;P&gt;where h[i] is the ith leverage value, sometimes called the "hat diagonal".&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could this be the missing "normalization factor" you are looking for?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here is a program that uses PROC REG to output the PRESS statistic, then uses a DATA step to accumulate the sum of the squares of the quantities Residual / (1 - h[i]). The DATA step validates the PRESS statistic in the OUTEST= data set.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc reg data=sashelp.class outest=est plots=none;
model weight = height age / press influence;
ods output  OutputStatistics=out;
quit;

proc print data=est; var _PRESS_; run;

data PRESS;
set out end=eof;
PRESS + (Residual / (1 - hatDiagonal))**2;
if eof then output;
keep PRESS;
run;

proc print data=PRESS; run;
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 15:43:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403214#M21029</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2017-10-11T15:43:29Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403240#M21033</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;The PRESS statistic is documented among the residual and influence statistics. PROC REG has a little bit, but the main reference is the&amp;nbsp;&lt;/SPAN&gt;&lt;A href="http://go.documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_introreg_sect036.htm&amp;amp;docsetVersion=14.3&amp;amp;locale=en" target="_self" rel="nofollow noopener noreferrer"&gt;"Predicted and Residual Values" section&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt;of the "Introduction to Regression Procedures" chapter.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Aha! This is why I remember not being satisfied with the PROC PLS documentation ... the information isn't in PROC PLS documentation, it is elsewhere in the SAS documentation. There&amp;nbsp;ought to be a hyperlink in the PROC PLS documentation to this "Predicted and Residual Values" section. A documentation deficiency!! Can we get this fixed,&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp;and &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/4"&gt;@ChrisHemedinger&lt;/a&gt;?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;In practice, there is no need to refit the model since you can efficiently compute the predicted residual by the formula&lt;/P&gt;
&lt;P&gt;PResid[i] = Resid[i] / (1-h[i])&lt;/P&gt;
&lt;P&gt;where h[i] is the ith leverage value, sometimes called the "hat diagonal".&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;While this is certainly true for Ordinary Least Squares Regression, I am not convinced it is true for a bilinear method such as Partial Least Squares Regression. I haven't worked through the math. But even if it is true, dividing by the value (1-h[i]) is not a normalization of any sort that forces the resulting value to be between 0 and 1, as the original questioner asked.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 16:22:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403240#M21033</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2017-10-11T16:22:53Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403245#M21035</link>
      <description>&lt;P&gt;Upon reading the original message more carefully, the question was about root mean PRESS being between 0 and 1 and not PRESS being between 0 and 1. Nevertheless, the answer is the same, the root mean press does not have to be between 0 and 1, as the example at&amp;nbsp;&lt;A href="http://documentation.sas.com/?cdcId=pgmmvacdc&amp;amp;cdcVersion=9.4&amp;amp;docsetId=statug&amp;amp;docsetTarget=statug_pls_examples01.htm&amp;amp;locale=en" target="_blank"&gt;http://documentation.sas.com/?cdcId=pgmmvacdc&amp;amp;cdcVersion=9.4&amp;amp;docsetId=statug&amp;amp;docsetTarget=statug_pls_examples01.htm&amp;amp;locale=en&lt;/A&gt; shows (where I ahve added CV=ONE into the PROC PLS statement). So there is no normalization going on.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I believe that the zero to one interpretation of the Root Mean PRESS is to give you a cutoff showing that the PLS Model is fitting better than simply fitting the mean. A root mean PRESS of 1 indicates the model is&amp;nbsp;just as good a predictor as&amp;nbsp;fitting the mean, a number &amp;gt; 1 indicates the model is predicting worse than fitting the mean, and a number &amp;lt; 1 indicates that the model is predicting better than fitting the mean (which is what you want). A zero root mean PRESS indicates the model fits exactly.&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 16:45:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403245#M21035</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2017-10-11T16:45:38Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403261#M21036</link>
      <description>&lt;P&gt;Thanks for your replies.&lt;/P&gt;&lt;P&gt;Rick_SAS, I applied your code with the output from&amp;nbsp;my proc PLS model. The root mean PRESS from the (resid[i]/1-h)**2 formula gave the same output for each observation&amp;nbsp;as using the PRESS ouput option from the PROC PLS model and squaring that value. When I take the root mean PRESS (sqrt(sum(press)/count(press))&amp;nbsp;from these manually calculated values and&amp;nbsp;plot them against the RMPRESS table output by PROC PLS by default, it&amp;nbsp;would appear&amp;nbsp;values are&amp;nbsp;close to some&amp;nbsp;logarithmic transformation.&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="PRESS comparison.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/15789iD86216B8460011A5/image-size/large?v=v2&amp;amp;px=999" role="button" title="PRESS comparison.jpg" alt="PRESS comparison.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 17:07:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403261#M21036</guid>
      <dc:creator>jroberts1992</dc:creator>
      <dc:date>2017-10-11T17:07:47Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403268#M21037</link>
      <description>&lt;P&gt;Perhaps ... because the formula given by Rick works for OLS (PROC REG), but not for PLS (PROC PLS). If I had the time, I would dig into this more.&lt;/P&gt;</description>
      <pubDate>Wed, 11 Oct 2017 17:27:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/403268#M21037</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2017-10-11T17:27:26Z</dc:date>
    </item>
    <item>
      <title>Re: PROC PLS root mean PRESS calculation</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/810860#M39937</link>
      <description>additional&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://support.sas.com/documentation/onlinedoc/stat/141/glmselect.pdf" target="_blank"&gt;https://support.sas.com/documentation/onlinedoc/stat/141/glmselect.pdf&lt;/A&gt; p.3892</description>
      <pubDate>Sun, 01 May 2022 02:56:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/PROC-PLS-root-mean-PRESS-calculation/m-p/810860#M39937</guid>
      <dc:creator>TET_34</dc:creator>
      <dc:date>2022-05-01T02:56:46Z</dc:date>
    </item>
  </channel>
</rss>

