BookmarkSubscribeRSS Feed
131_04
Calcite | Level 5

Given two identical set of data A and B with the largest value of data set B being 3 times greater than the largest value of data set A. How are the median, means, standard deviation and box-and-whisker plots of the two data set compare?

2 REPLIES 2
PaigeMiller
Diamond | Level 26

Since this sounds like a homework assignment, perhaps you could try answering these questions (and give your reasons) and then let us know, we can help you if your answers aren't correct.

--
Paige Miller
ballardw
Super User

Your basic phrasing of the question is internally inconsistent: two identical set of data A and B means that there are no differences. If you mean similar data structure, number, name and types of variables then state so.

 

By "largest value" do you mean a single difference for exactly one variable in one set with all other values for all variables the same? Then the answer about median can't be answered without exact data. If the single value replaced by the largest is the largest (not stated, you only say the largest is 3 times greater not that it replaced the other) for the corresponding variable in the data there would not be any change in median (order of the value is important), if some other value is replaced then there could be a difference in medians based on your specific rules for tie breaking.

 

Replacing a single value would effect none of the other variables at all.

 

If your data set is "large", which might depend on the actual range of values in the sets, the difference in mean or standard deviation might not be easily detectable.

 

Suggestion: Take a data set and modify a copy then summarize the two sets. Repeat with data sets with many more records.

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 460 views
  • 2 likes
  • 3 in conversation