<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Proc univariate normality test and Shapiro-Wilk test in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678746#M204909</link>
    <description>&lt;P&gt;If you read this link &lt;A href="https://en.wikipedia.org/wiki/Shapiro%E2%80%93Wilk_test" target="_blank"&gt;https://en.wikipedia.org/wiki/Shapiro%E2%80%93Wilk_test&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;you will find that some folks have extended the test to 5000 records but apparently that has not been added to SAS yet. I suspect a big issue is determining the critical points of the test statistic. The phrase from that link&amp;nbsp; &lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;The cutoff values for the statistics are calculated through Monte-Carlo simulations.&lt;SUP id="cite_ref-mit_2-1" class="reference"&gt;&lt;/SUP&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;is an indicator to me that it is NOT a trivial exercise to expand sample sizes.&lt;/P&gt;
&lt;P&gt;You do not indicate how big your sample might be. If it is greater than 5000 it looks like you would get your name attached if your work this out.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Short answer: Ain't gonna happen in SAS Proc Univariate.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 23 Aug 2020 13:01:16 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2020-08-23T13:01:16Z</dc:date>
    <item>
      <title>Proc univariate normality test and Shapiro-Wilk test</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678742#M204907</link>
      <description>&lt;P&gt;Dear all&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am having trouble with obtaining a normality test result using the Shapiro-Wilk (SW) test.&lt;/P&gt;
&lt;P&gt;Usually, I have used the &lt;EM&gt;Univariate&lt;/EM&gt; procedure with &lt;EM&gt;normal&lt;/EM&gt; or &lt;EM&gt;normaltest&lt;/EM&gt; options and was able to easily get normality test results for all four tests.&lt;/P&gt;
&lt;P&gt;e.g.)&lt;/P&gt;
&lt;P&gt;Proc univariate data=work.have normal;&lt;/P&gt;
&lt;P&gt;var series;&lt;/P&gt;
&lt;P&gt;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Now, I'm dealing with a larger dataset of which the observations are apparently greater than 2000, and the SAS suddenly does not show SW test results.&lt;/P&gt;
&lt;P&gt;All other test statistics are fine, but only the SW is omitted.&lt;/P&gt;
&lt;P&gt;I read that the SAS program automatically omits the SW test when the sample size is greater than 2000.&lt;/P&gt;
&lt;P&gt;But, I still want to see the results using the SW test.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So, my question is, is there any other way that I can specify the SW method to test normality of the distribution of my dataset, which contains more than 2000 observations.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;My dataset is shown as below.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;data work.have;
   input date: yymmddn8. series;
   format date yymmddn8.;
   datalines;
   20170101 501;
   20170102 500.5;
   20170103 505;
   ...
   20171231 512;
run;&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'd really appreciate if anyone could help me get this test result.&lt;/P&gt;
&lt;P&gt;Hope my description was sufficient to understand.&lt;/P&gt;
&lt;P&gt;But, it hadn't been so, please leave a note and ask.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 23 Aug 2020 12:23:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678742#M204907</guid>
      <dc:creator>LzEr23</dc:creator>
      <dc:date>2020-08-23T12:23:28Z</dc:date>
    </item>
    <item>
      <title>Re: Proc univariate normality test and Shapiro-Wilk test</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678744#M204908</link>
      <description>&lt;P&gt;The &lt;A href="https://documentation.sas.com/?docsetId=procstat&amp;amp;docsetVersion=9.4&amp;amp;docsetTarget=procstat_univariate_details53.htm&amp;amp;locale=en" target="_self"&gt;documentation&lt;/A&gt; is pretty clear&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV class="xisDoc-title"&gt;
&lt;DIV&gt;
&lt;DIV&gt;
&lt;BLOCKQUOTE&gt;
&lt;H4 class="xisDoc-title"&gt;Shapiro-Wilk Statistic&lt;/H4&gt;
&lt;/BLOCKQUOTE&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BLOCKQUOTE&gt;
&lt;P class="xisDoc-paragraph"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="xisDoc-paragraph"&gt;If the sample size is less than or equal to 2000 and you specify the NORMAL option, PROC UNIVARIATE computes the Shapiro-Wilk statistic, &lt;SPAN class=" aa-mathtext"&gt;W&lt;/SPAN&gt; (also denoted as to emphasize its dependence on the sample size &lt;SPAN class=" aa-mathtext"&gt;n&lt;/SPAN&gt;).&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P class="xisDoc-paragraph"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="xisDoc-paragraph"&gt;So you can't get this statistic calculated for sample sizes above 2000. Other tests of normality should be used with sample sizes above 2000.&lt;/P&gt;</description>
      <pubDate>Sun, 23 Aug 2020 12:58:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678744#M204908</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2020-08-23T12:58:16Z</dc:date>
    </item>
    <item>
      <title>Re: Proc univariate normality test and Shapiro-Wilk test</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678746#M204909</link>
      <description>&lt;P&gt;If you read this link &lt;A href="https://en.wikipedia.org/wiki/Shapiro%E2%80%93Wilk_test" target="_blank"&gt;https://en.wikipedia.org/wiki/Shapiro%E2%80%93Wilk_test&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;you will find that some folks have extended the test to 5000 records but apparently that has not been added to SAS yet. I suspect a big issue is determining the critical points of the test statistic. The phrase from that link&amp;nbsp; &lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;The cutoff values for the statistics are calculated through Monte-Carlo simulations.&lt;SUP id="cite_ref-mit_2-1" class="reference"&gt;&lt;/SUP&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;is an indicator to me that it is NOT a trivial exercise to expand sample sizes.&lt;/P&gt;
&lt;P&gt;You do not indicate how big your sample might be. If it is greater than 5000 it looks like you would get your name attached if your work this out.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Short answer: Ain't gonna happen in SAS Proc Univariate.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 23 Aug 2020 13:01:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678746#M204909</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2020-08-23T13:01:16Z</dc:date>
    </item>
    <item>
      <title>Re: Proc univariate normality test and Shapiro-Wilk test</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678857#M204969</link>
      <description>&lt;P&gt;Maybe&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13684"&gt;@Rick_SAS&lt;/a&gt;&amp;nbsp; might write a blog (iml code) for this question ?&lt;/P&gt;</description>
      <pubDate>Mon, 24 Aug 2020 11:54:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Proc-univariate-normality-test-and-Shapiro-Wilk-test/m-p/678857#M204969</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2020-08-24T11:54:18Z</dc:date>
    </item>
  </channel>
</rss>

