<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Score test and Wald test show widely discrepant results with sandwich estimator in PHREG in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214448#M11592</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks.&amp;nbsp; Now I definitely vote for "problematic patterns in the data that could be responsible for the widely divergent test statistics above."&amp;nbsp; It could be that the partial likelihood for some clusters is such that the martingale residual under TIES=EFRON is quite large.&amp;nbsp; I don't really have a good work around--the first is to look at the values under TIES=BRESLOW, but I bet they show the same pattern.&amp;nbsp; You may have to really dig into the responses in each cluster and the metadata for the clusters to see whether clusters can be consolidated (or removed, although that seems extreme).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If there are structural reasons that not all categories are present in all clusters, what about separating into 2 (or maybe more analyses) by "super-clusters" that have common categories?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 31 Jul 2015 14:00:09 GMT</pubDate>
    <dc:creator>SteveDenham</dc:creator>
    <dc:date>2015-07-31T14:00:09Z</dc:date>
    <item>
      <title>Score test and Wald test show widely discrepant results with sandwich estimator in PHREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214445#M11589</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I have time to event data with clustered observations, so I am using proc phreg like so:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc phreg data = xxx covs(aggregate);&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; by byvar;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; class cluster category;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; model month * status(0) = pred cluster category / ties = efron;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; id cluster;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My problem is that when I run this model, I get this output:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="margin-bottom: .0001pt;"&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt;Testing Global Null Hypothesis: BETA=0&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="margin-bottom: .0001pt;"&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt;Test&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Chi-Square&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; DF Pr &amp;gt; ChiSq&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="margin-bottom: .0001pt;"&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt; Likelihood Ratio&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 214.2638&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 32&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;.0001&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="margin-bottom: .0001pt;"&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt;Score (Model-Based)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 375.3325&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 32&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;.0001&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="margin-bottom: .0001pt;"&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt;Score (Sandwich)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 21.0000&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 21&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.4589&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="margin-bottom: .0001pt;"&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt;Wald (Model-Based)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 231.4213&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 32&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;.0001&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10pt; font-family: terminal, monaco;"&gt;Wald (Sandwich)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1.13909E11&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 21&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;.0001&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The Wald(Sandwich) Chi-Square is huge and significant; the Score(Sandwich) is small and not anywhere near significant. Is it possible there's something wrong with the Score(Sandwich)? Or the Wald(Sandwich)? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can anybody help with interpretation here?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;By the way, I initially had problems with this model getting a divide-by-zero error for one of the two bygroups when I used "/ ties = exact". I switched to "/ ties = efron", which does not give me problems. Still, I wonder if this means I have problematic patterns in the data that could be responsible for the widely divergent test statistics above.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Also, FWIW, I was wondering if this discrepancy had anything to do with the &lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;inclusion of the cluster variable (which has roughly n = 20 categories) in the model. Indeed, removing the variable from the model statement substantially reduces the size of the Wald(Sandwich) chi-square (which remains significant) while cutting the p-value of the Score(Sandwich) by about 75% (which leaves it still non-significant).&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Answers, suggestions, and questions all welcome.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Jul 2015 18:01:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214445#M11589</guid>
      <dc:creator>MichaelLichter</dc:creator>
      <dc:date>2015-07-30T18:01:28Z</dc:date>
    </item>
    <item>
      <title>Re: Score test and Wald test show widely discrepant results with sandwich estimator in PHREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214446#M11590</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;By any chance, do some values of category not appear in all clusters?&amp;nbsp; That would at least explain what is going on when you drop the cluster variable from the model.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Jul 2015 19:48:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214446#M11590</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-07-30T19:48:27Z</dc:date>
    </item>
    <item>
      <title>Re: Score test and Wald test show widely discrepant results with sandwich estimator in PHREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214447#M11591</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Steve, that is correct. For legitimate reasons, not all categories were present in all clusters.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Jul 2015 20:13:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214447#M11591</guid>
      <dc:creator>MichaelLichter</dc:creator>
      <dc:date>2015-07-30T20:13:35Z</dc:date>
    </item>
    <item>
      <title>Re: Score test and Wald test show widely discrepant results with sandwich estimator in PHREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214448#M11592</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks.&amp;nbsp; Now I definitely vote for "problematic patterns in the data that could be responsible for the widely divergent test statistics above."&amp;nbsp; It could be that the partial likelihood for some clusters is such that the martingale residual under TIES=EFRON is quite large.&amp;nbsp; I don't really have a good work around--the first is to look at the values under TIES=BRESLOW, but I bet they show the same pattern.&amp;nbsp; You may have to really dig into the responses in each cluster and the metadata for the clusters to see whether clusters can be consolidated (or removed, although that seems extreme).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If there are structural reasons that not all categories are present in all clusters, what about separating into 2 (or maybe more analyses) by "super-clusters" that have common categories?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 31 Jul 2015 14:00:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214448#M11592</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-07-31T14:00:09Z</dc:date>
    </item>
    <item>
      <title>Re: Score test and Wald test show widely discrepant results with sandwich estimator in PHREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214449#M11593</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks, Steve. TIES=BRESLOW produces the same results. I haven't yet had time to look at consolidating similar clusters, but that is worth looking into.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 03 Aug 2015 21:20:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214449#M11593</guid>
      <dc:creator>MichaelLichter</dc:creator>
      <dc:date>2015-08-03T21:20:25Z</dc:date>
    </item>
    <item>
      <title>Re: Score test and Wald test show widely discrepant results with sandwich estimator in PHREG</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214450#M11594</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The key will be to consolidate based on metadata, not on the design or response data.&amp;nbsp; Otherwise, you just end up with fewer but larger clusters with the same problem.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 04 Aug 2015 12:47:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Score-test-and-Wald-test-show-widely-discrepant-results-with/m-p/214450#M11594</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2015-08-04T12:47:54Z</dc:date>
    </item>
  </channel>
</rss>

