<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Huge Sample Size and Wilcoxon 2 Sampled Test in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68612#M3339</link>
    <description>What you are describing is the difference between "statistical significance" and "practical significance".  The computation of a p-value is based on a formula that includes a function of the sample size in the denominator and a function of the "effect size" (median difference, here) in the numerator.  Therefore, for any given effect size, you get a smaller and smaller p-value just by increasing the sample size.&lt;BR /&gt;
&lt;BR /&gt;
When the sample size is really large (like yours), you have to pretty much ignore the p-value as an "index of importance" and just look at the effect size itself.&lt;BR /&gt;
&lt;BR /&gt;
BTW, the Wilcoxon is a rather poor test if the dependent variable is binary, it loses sensitivity when there are lots of ties.  You are probably better with a chi-squared test of the difference in attrition proportion across market segments.</description>
    <pubDate>Tue, 01 Sep 2009 18:43:25 GMT</pubDate>
    <dc:creator>Doc_Duke</dc:creator>
    <dc:date>2009-09-01T18:43:25Z</dc:date>
    <item>
      <title>Huge Sample Size and Wilcoxon 2 Sampled Test</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68610#M3337</link>
      <description>Hello,&lt;BR /&gt;
&lt;BR /&gt;
I am looking at attrition differences across market segments (attrition is a binary variable, 0 they left, 1 they didn't). All my my results are highly significant - even when I look at differences between two market segments that I know, from past analysis, have minimal differences in attrition. I am wondering if my huge sample size (8,000,000) is overwhelming the test. When I limit the data to a sample of 10,000, the results are more in line with what I have seen in the past, but I only want to limit the sample size if it is more accurate, not because it is giving me the results I want.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
My code:&lt;BR /&gt;
&lt;BR /&gt;
PROC NPAR1WAY WILCOXON DATA = WORK.ATTR_TEMP_SS_T2 ;&lt;BR /&gt;
CLASS segment1;&lt;BR /&gt;
VAR att_60day;&lt;BR /&gt;
RUN;</description>
      <pubDate>Tue, 01 Sep 2009 15:16:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68610#M3337</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2009-09-01T15:16:18Z</dc:date>
    </item>
    <item>
      <title>Re: Huge Sample Size and Wilcoxon 2 Sampled Test</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68611#M3338</link>
      <description>I'm not a statistician, just a user of statistics, but:&lt;BR /&gt;
1. Are the proportions very different with the smaller and larger samples?&lt;BR /&gt;
2. Is this an inference issue rather than a statistics issue per se? I recently was reading Nakagawa and Cuthill (below); they make the point (and I've seen other versions of this illustration) that, if one could measure the lengths of wings of all the birds in a population, the difference in mean wing length between sexes would almost certainly be statistically significant. Whether this would mean something to biologists is another question.&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
Nakagawa, S., and I. C. Cuthill. 2007. Effect size, confidence interval and statistical significance: A practical guide for biologists. Biological Reviews 82: 591-605.</description>
      <pubDate>Tue, 01 Sep 2009 18:40:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68611#M3338</guid>
      <dc:creator>ChuckBV</dc:creator>
      <dc:date>2009-09-01T18:40:55Z</dc:date>
    </item>
    <item>
      <title>Re: Huge Sample Size and Wilcoxon 2 Sampled Test</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68612#M3339</link>
      <description>What you are describing is the difference between "statistical significance" and "practical significance".  The computation of a p-value is based on a formula that includes a function of the sample size in the denominator and a function of the "effect size" (median difference, here) in the numerator.  Therefore, for any given effect size, you get a smaller and smaller p-value just by increasing the sample size.&lt;BR /&gt;
&lt;BR /&gt;
When the sample size is really large (like yours), you have to pretty much ignore the p-value as an "index of importance" and just look at the effect size itself.&lt;BR /&gt;
&lt;BR /&gt;
BTW, the Wilcoxon is a rather poor test if the dependent variable is binary, it loses sensitivity when there are lots of ties.  You are probably better with a chi-squared test of the difference in attrition proportion across market segments.</description>
      <pubDate>Tue, 01 Sep 2009 18:43:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Huge-Sample-Size-and-Wilcoxon-2-Sampled-Test/m-p/68612#M3339</guid>
      <dc:creator>Doc_Duke</dc:creator>
      <dc:date>2009-09-01T18:43:25Z</dc:date>
    </item>
  </channel>
</rss>

