<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Advantages of higher or lower K1 in PROC ROBUSTREG in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Advantages-of-higher-or-lower-K1-in-PROC-ROBUSTREG/m-p/612512#M29628</link>
    <description>&lt;P&gt;1. K1 does not affect the efficiency of the &lt;STRONG&gt;procedure&lt;/STRONG&gt;, it affects &lt;A href="https://en.wikipedia.org/wiki/Efficiency_(statistics)" target="_self"&gt;the efficiency of the &lt;STRONG&gt;estimator,&lt;/STRONG&gt;&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;We know that under the usual assumptions of linear regression that the least squares estimates of the betas are BLUE. The ROBUSTREG doc seems to be saying that the efficiency of the M estimator is a certain percentage of the OLS estimates when the scaling parameter k1 is properly chosen.&amp;nbsp; In other words, the M estimates for the betas have more variance (they have to), but not too much more.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. The k1 parameter simply scales the function used to penalize large residuals. For OLS, the penalty function is the quadratic function and we try to minimize the sum of the SQUARES of the residuals. For M estimation, we replace the quadratic function with a different function that caps the weights given to extreme residuals. The Tukey and Yohai functions are two choices. You minimize the sum of the "Tukey function" (or "Yohai function") of the residuals. The following graph compares the Tukey and Yohai functions to the quadratic function. For large residuals (large values of s), the penalty from Tukey or Yohai is much less than for the quadratic function that OLS uses.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data Rho;
b0 = 1.792; b1 = -0.972; b2 = 0.432; b3 = -0.052; b4 = 0.002;
do s = -5 to 5 by 0.1;
   k1 = 3.440;
   t = s / k1;
   if abs(s) &amp;lt;= k1 then
      Tukey = 3*t**2 - 3*t**4 + t**6;
   else Tukey=1;

   k1 = 0.868;
   t = s / k1;
   if abs(s) &amp;lt;= 2*k1 then
      Yohai = s**2/2;
   else if 2*k1 &amp;lt; abs(s) and abs(s) &amp;lt;= 3*k1 then
      Yohai = k1**2 * (b0+b1*t**2 + b2*t**4 + b3*t**6 + b4*t**8);
   else Yohai = 3.25*k1**2;

   Quadratic = s**2;
   if Quadratic &amp;gt; 3 then Quadratic=.;  /* cap the height of the quadratic function */
   output;
end;
run;

proc sgplot data=rho;
series x=s y=Tukey / curvelabel;
series x=s y=Yohai / curvelabel;
series x=s y=Quadratic / curvelabel;
xaxis label="Size of Residual";
yaxis label="Weight Given to Penalty Function";
run;&lt;/CODE&gt;&lt;/PRE&gt;</description>
    <pubDate>Tue, 17 Dec 2019 19:03:45 GMT</pubDate>
    <dc:creator>Rick_SAS</dc:creator>
    <dc:date>2019-12-17T19:03:45Z</dc:date>
    <item>
      <title>Advantages of higher or lower K1 in PROC ROBUSTREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Advantages-of-higher-or-lower-K1-in-PROC-ROBUSTREG/m-p/612477#M29625</link>
      <description>&lt;P&gt;In SAS PROC ROBUSTREG you can set K1, which affects the efficiency of the procedure. But I didn't see anything in the documentation about exactly what "efficiency" means nor about the advantages of changing K1 from its default value.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any insights would be appreciated.&lt;/P&gt;</description>
      <pubDate>Tue, 17 Dec 2019 17:50:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Advantages-of-higher-or-lower-K1-in-PROC-ROBUSTREG/m-p/612477#M29625</guid>
      <dc:creator>plf515</dc:creator>
      <dc:date>2019-12-17T17:50:21Z</dc:date>
    </item>
    <item>
      <title>Re: Advantages of higher or lower K1 in PROC ROBUSTREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Advantages-of-higher-or-lower-K1-in-PROC-ROBUSTREG/m-p/612512#M29628</link>
      <description>&lt;P&gt;1. K1 does not affect the efficiency of the &lt;STRONG&gt;procedure&lt;/STRONG&gt;, it affects &lt;A href="https://en.wikipedia.org/wiki/Efficiency_(statistics)" target="_self"&gt;the efficiency of the &lt;STRONG&gt;estimator,&lt;/STRONG&gt;&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;We know that under the usual assumptions of linear regression that the least squares estimates of the betas are BLUE. The ROBUSTREG doc seems to be saying that the efficiency of the M estimator is a certain percentage of the OLS estimates when the scaling parameter k1 is properly chosen.&amp;nbsp; In other words, the M estimates for the betas have more variance (they have to), but not too much more.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. The k1 parameter simply scales the function used to penalize large residuals. For OLS, the penalty function is the quadratic function and we try to minimize the sum of the SQUARES of the residuals. For M estimation, we replace the quadratic function with a different function that caps the weights given to extreme residuals. The Tukey and Yohai functions are two choices. You minimize the sum of the "Tukey function" (or "Yohai function") of the residuals. The following graph compares the Tukey and Yohai functions to the quadratic function. For large residuals (large values of s), the penalty from Tukey or Yohai is much less than for the quadratic function that OLS uses.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data Rho;
b0 = 1.792; b1 = -0.972; b2 = 0.432; b3 = -0.052; b4 = 0.002;
do s = -5 to 5 by 0.1;
   k1 = 3.440;
   t = s / k1;
   if abs(s) &amp;lt;= k1 then
      Tukey = 3*t**2 - 3*t**4 + t**6;
   else Tukey=1;

   k1 = 0.868;
   t = s / k1;
   if abs(s) &amp;lt;= 2*k1 then
      Yohai = s**2/2;
   else if 2*k1 &amp;lt; abs(s) and abs(s) &amp;lt;= 3*k1 then
      Yohai = k1**2 * (b0+b1*t**2 + b2*t**4 + b3*t**6 + b4*t**8);
   else Yohai = 3.25*k1**2;

   Quadratic = s**2;
   if Quadratic &amp;gt; 3 then Quadratic=.;  /* cap the height of the quadratic function */
   output;
end;
run;

proc sgplot data=rho;
series x=s y=Tukey / curvelabel;
series x=s y=Yohai / curvelabel;
series x=s y=Quadratic / curvelabel;
xaxis label="Size of Residual";
yaxis label="Weight Given to Penalty Function";
run;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 17 Dec 2019 19:03:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Advantages-of-higher-or-lower-K1-in-PROC-ROBUSTREG/m-p/612512#M29628</guid>
      <dc:creator>Rick_SAS</dc:creator>
      <dc:date>2019-12-17T19:03:45Z</dc:date>
    </item>
  </channel>
</rss>

