BookmarkSubscribeRSS Feed
dustychair
Pyrite | Level 9

Hi all,

I have a question. I have a data set like below. This is just an example what I have but I have a large sample. The first group has 150,000 people and the second group has 15,000 people. I tested the variance of score of two groups with Levene test and found out that the variance of scores of two groups are different. I would like to select two samples of groups where variances of scores are similar. Also, with that sample I would like to keep the sample ratios similar. What I mean by that for example the first group will have 12000 and the other one have 1200, the sample size is different than the original but the ratio of samples are similar. The point is that I  have similar sample size ratio. I have no idea how can I do that. Any help is appreciated.

student_id group score

110 1 430

111 1 622

112 2 530

113 2 532

114 1 410

4 REPLIES 4
PaigeMiller
Diamond | Level 26

I have a question and I have a comment.

 

The question is: what statistic(s) do you want to compute from this sample?

 

My comment is with regards to your desired to have a sample of 12000 from one group and 1200 from the other group. I don't know why you insist on this, but depending on your answer to my question above, samples of 12000 are rarely worth the effort from a statistical point of view, the extra information you get is not linear with sample size, there is a point of diminishing returns where you get little benefit of (for example) going beyond a sample size of 2000.

--
Paige Miller
dustychair
Pyrite | Level 9

Hi @PaigeMiller,

I am going to do DIF (Differential Item Function) analysis after I have two groups who have similar score variances. Sample size is not big deal, It can be 2000 but it shouldn't be less than 1000 since some DIF methods require large sample size. My point is the ratio of two samples should be similar to original one. 

 

Thank you

PaigeMiller
Diamond | Level 26

Okay, thank you. I am not familiar with DIF so I can't comment further.

--
Paige Miller

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 4 replies
  • 1020 views
  • 0 likes
  • 3 in conversation