BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Omenski
Calcite | Level 5

How do I interpret test for normality results using SAS? I have ran the univariate for my data and obtained the results attached. From the probability which is the criteria for accepting or rejecting the normality test? Which of the three methods is best to use? Thanks

 

1 ACCEPTED SOLUTION

Accepted Solutions
PGStats
Opal | Level 21

There is no best method. Each test looks at different aspects of distributions. Thus the minimum p-value among the tests is the one to consider. It tells you about some aspect in witch your distribution differs the most from the normal distribution.

 

But please consider the pitfalls of normality testing explained here. Most specifically:

 

"If you want to test the normality assumptions for analysis of variance methods, beware of using a statistical test for normality alone. A test’s ability to reject the null hypothesis (known as the power of the test) increases with the sample size. As the sample size becomes larger, increasingly smaller departures from normality can be detected. Because small deviations from normality do not severely affect the validity of analysis of variance tests, it is important to examine other statistics and plots to make a final assessment of normality. The skewness and kurtosis measures and the plots that are provided by the PLOTS option, the HISTOGRAM statement, the PROBPLOT statement, and the QQPLOT statement can be very helpful. For small sample sizes, power is low for detecting larger departures from normality that might be important. To increase the test’s ability to detect such deviations, you might want to declare significance at higher levels, such as 0.15 or 0.20, rather than the often-used 0.05 level. Again, consulting plots and additional statistics can help you assess the severity of the deviations from normality."

PG

View solution in original post

5 REPLIES 5
PaigeMiller
Diamond | Level 26

Most of us will not download Microsoft Office documents, as they are a security threat. Please paste your results right into your reply.

--
Paige Miller
PGStats
Opal | Level 21

There is no best method. Each test looks at different aspects of distributions. Thus the minimum p-value among the tests is the one to consider. It tells you about some aspect in witch your distribution differs the most from the normal distribution.

 

But please consider the pitfalls of normality testing explained here. Most specifically:

 

"If you want to test the normality assumptions for analysis of variance methods, beware of using a statistical test for normality alone. A test’s ability to reject the null hypothesis (known as the power of the test) increases with the sample size. As the sample size becomes larger, increasingly smaller departures from normality can be detected. Because small deviations from normality do not severely affect the validity of analysis of variance tests, it is important to examine other statistics and plots to make a final assessment of normality. The skewness and kurtosis measures and the plots that are provided by the PLOTS option, the HISTOGRAM statement, the PROBPLOT statement, and the QQPLOT statement can be very helpful. For small sample sizes, power is low for detecting larger departures from normality that might be important. To increase the test’s ability to detect such deviations, you might want to declare significance at higher levels, such as 0.15 or 0.20, rather than the often-used 0.05 level. Again, consulting plots and additional statistics can help you assess the severity of the deviations from normality."

PG
Omenski
Calcite | Level 5

Thanks for the information highly appreciated

Rick_SAS
SAS Super FREQ

For almost all statistical tests, you should REJECT the null hypothesis when the p-value is smaller than your significance criterion (typically 0.05 or 0.01). The null hypothesis for these tests is that the observed data comes from a normal distribution (with an unknown mean and variance). 

 

So "small p-value" ==> the evidence does not support the hypothesis that the data are normal.

"Large p-value" ==> we cannot discount the hypothesis that data are normal.

 

As to which is better, that question has been asked in many papers and books. Often they give similar results. The Wikipedia article for these tests states some of the advantages/disadvantages of the tests. For example, start with the Anderson-Darling test.

 

 

Omenski
Calcite | Level 5

Thanks guidance highly appreciated

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 5 replies
  • 1147 views
  • 0 likes
  • 4 in conversation