BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
haoduonge
Quartz | Level 8

Hi all,

I have a very basic analysis (t-test) and need your comments on it.

One assumption for a t-test is "the dependent variable should be normally distributed for each category of the independent variable". But also, "it is quite "robust" to violations of normality, meaning that this assumption can be a little violated and still provide valid results".

Therefore, I decide to use rank-sum test (non-parametric) only when both groups (of the independent variable) are not normally distributed. However, if one group has a small sample N, it is mostly normally distributed.

For example: two groups of 100 – both are not normally distributed

But if one group of 180, and one group of 20 then the group of 20 is mostly normally distributed in all variables examined.

I really appreciate your advice on this basic issue.

Thanks!

Hao

 

https://statistics.laerd.com/stata-tutorials/independent-t-test-using-stata.php

 

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

Using the Central Limit Theorem, as your N increases, the distribution of the mean approaches a normal distribution. So, in my work, we usually have large N values, and so I don't worry about it. I go ahead and use the t-test.

 

Naturally, if your N is low (let's say < 50), you might want to use a non-parametric test such as those found in PROC NPAR1WAY or PROC UNIVARIATE.

--
Paige Miller

View solution in original post

4 REPLIES 4
PaigeMiller
Diamond | Level 26

Many people, including me, will not click on links to "unknown" web sites. I suggest whatever it is you want us to see, copy and paste it into a message here.

 

Also, its not clear to me what your question is, and it doesn't really appear that you have actually asked a question.

--
Paige Miller
haoduonge
Quartz | Level 8

You don't need to click on that link, it is just the basic knowlegge about assumption (normal distribution) for a t-test. 

Even it (t-test) requires assumption of normally distributed data, it is quite "robust" to violations of the normality assumption.

My question is:

Do you use non-parametric test (eg, rank-sum test) in any case the assumption of normality is violated or you are easy on it? if so, at what level?

Hao

 

Ksharp
Super User

"Do you use non-parametric test (eg, rank-sum test) in any case the assumption of normality is violated"

Yes. I would also try Wilcoxon test .and compare these two result to see if they are the same. If not ,I would rather trust Wilconxon test.

Or Maybe @Rick_SAS would comment something .

PaigeMiller
Diamond | Level 26

Using the Central Limit Theorem, as your N increases, the distribution of the mean approaches a normal distribution. So, in my work, we usually have large N values, and so I don't worry about it. I go ahead and use the t-test.

 

Naturally, if your N is low (let's say < 50), you might want to use a non-parametric test such as those found in PROC NPAR1WAY or PROC UNIVARIATE.

--
Paige Miller

SAS Innovate 2025: Register Now

Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 5492 views
  • 5 likes
  • 3 in conversation