<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Accounting for regression to the mean in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126456#M6656</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Apologies if this is duplicate question, but I can't find anything similar using search. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a data set containing a continuous variable, isotope GFR (which should remain stable) measured at time 0 and at 4 months.&amp;nbsp; There are three categorical outcome states assigned at four months (stable, improved, deteriorated).&amp;nbsp; I'm trying to consider differences in baseline variable between outcome groups. &lt;/P&gt;&lt;P&gt;Using scatter and Galton plots it appears there is a degree of regression to the mean in the measured variable.&amp;nbsp; My thought on accounting for this is to use ANCOVA as below.&amp;nbsp; However, my stats background is limited and I'd be very grateful for any comments as to the appropriateness of this method and / or advice on how to better handle this. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Many thanks&lt;/P&gt;&lt;P&gt;Jime&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New'; color: #011993;"&gt;&lt;STRONG&gt;PROC&lt;/STRONG&gt;&lt;SPAN style="color: #000000;"&gt; &lt;/SPAN&gt;&lt;STRONG&gt;GLM&lt;/STRONG&gt;&lt;SPAN style="color: #000000;"&gt; &lt;/SPAN&gt;&lt;SPAN style="color: #0433ff;"&gt;DATA&lt;/SPAN&gt;&lt;SPAN style="color: #000000;"&gt;=Date;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&amp;nbsp; &lt;SPAN style="color: #0433ff;"&gt;CLASS&lt;/SPAN&gt; RESPONSE;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&amp;nbsp; &lt;SPAN style="color: #0433ff;"&gt;MODEL&lt;/SPAN&gt; GFR_0 =RESPONSE (GFR_4-GFR_0)&amp;nbsp;&amp;nbsp; ;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&amp;nbsp; &lt;SPAN style="color: #0433ff;"&gt;LSMEANS&lt;/SPAN&gt; RESPONSE / &lt;SPAN style="color: #0433ff;"&gt;ADJUST&lt;/SPAN&gt;=TUKEY &lt;SPAN style="color: #0433ff;"&gt;PDIFF&lt;/SPAN&gt; ;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New'; color: #011993;"&gt;&lt;STRONG&gt;RUN&lt;/STRONG&gt;&lt;SPAN style="color: #000000;"&gt;;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 01 May 2013 12:46:51 GMT</pubDate>
    <dc:creator>Jimbo</dc:creator>
    <dc:date>2013-05-01T12:46:51Z</dc:date>
    <item>
      <title>Accounting for regression to the mean</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126456#M6656</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Apologies if this is duplicate question, but I can't find anything similar using search. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a data set containing a continuous variable, isotope GFR (which should remain stable) measured at time 0 and at 4 months.&amp;nbsp; There are three categorical outcome states assigned at four months (stable, improved, deteriorated).&amp;nbsp; I'm trying to consider differences in baseline variable between outcome groups. &lt;/P&gt;&lt;P&gt;Using scatter and Galton plots it appears there is a degree of regression to the mean in the measured variable.&amp;nbsp; My thought on accounting for this is to use ANCOVA as below.&amp;nbsp; However, my stats background is limited and I'd be very grateful for any comments as to the appropriateness of this method and / or advice on how to better handle this. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Many thanks&lt;/P&gt;&lt;P&gt;Jime&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New'; color: #011993;"&gt;&lt;STRONG&gt;PROC&lt;/STRONG&gt;&lt;SPAN style="color: #000000;"&gt; &lt;/SPAN&gt;&lt;STRONG&gt;GLM&lt;/STRONG&gt;&lt;SPAN style="color: #000000;"&gt; &lt;/SPAN&gt;&lt;SPAN style="color: #0433ff;"&gt;DATA&lt;/SPAN&gt;&lt;SPAN style="color: #000000;"&gt;=Date;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&amp;nbsp; &lt;SPAN style="color: #0433ff;"&gt;CLASS&lt;/SPAN&gt; RESPONSE;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&amp;nbsp; &lt;SPAN style="color: #0433ff;"&gt;MODEL&lt;/SPAN&gt; GFR_0 =RESPONSE (GFR_4-GFR_0)&amp;nbsp;&amp;nbsp; ;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New';"&gt;&amp;nbsp; &lt;SPAN style="color: #0433ff;"&gt;LSMEANS&lt;/SPAN&gt; RESPONSE / &lt;SPAN style="color: #0433ff;"&gt;ADJUST&lt;/SPAN&gt;=TUKEY &lt;SPAN style="color: #0433ff;"&gt;PDIFF&lt;/SPAN&gt; ;&lt;/P&gt;&lt;P style="font-size: 10px; font-family: 'Courier New'; color: #011993;"&gt;&lt;STRONG&gt;RUN&lt;/STRONG&gt;&lt;SPAN style="color: #000000;"&gt;;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 01 May 2013 12:46:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126456#M6656</guid>
      <dc:creator>Jimbo</dc:creator>
      <dc:date>2013-05-01T12:46:51Z</dc:date>
    </item>
    <item>
      <title>Re: Accounting for regression to the mean</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126457#M6657</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Try changing the model statement to:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;MODEL GFR_4=RESPONSE GFR_0;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This will give the four month value as adjusted for the time 0 value.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You should probably have a preliminary step, to check the homogeneity of slopes across the RESPONSE categories (see Milliken and Johnson's &lt;EM&gt;Analysis of Messy Data III: Analysis of Covariance&lt;/EM&gt;).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So first fit:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;MODEL GFR_4=RESPONSE GFR_0 GFR_0*RESPONSE;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;and check the significance of the interaction term.&amp;nbsp; If it is non-significant, then the MODEL statement I gave at first is appropriate.&amp;nbsp; If it is significant, then the differences need to be calculated at a minimum of three time zero values (low, median, high) using multiple LSMEANS statements and the AT= option (check the documentation on how to do this).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 01 May 2013 13:01:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126457#M6657</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2013-05-01T13:01:11Z</dc:date>
    </item>
    <item>
      <title>Re: Accounting for regression to the mean</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126458#M6658</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Many thanks Steve!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 01 May 2013 13:15:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Accounting-for-regression-to-the-mean/m-p/126458#M6658</guid>
      <dc:creator>Jimbo</dc:creator>
      <dc:date>2013-05-01T13:15:00Z</dc:date>
    </item>
  </channel>
</rss>

