Comparing Two Populations: Center, Variability, and Informal Inference
Show all work for MAD and IQR computations. When writing inference statements, include center values, variability measures, and hedging language.
Recall / Warm-Up
A data set is: 4, 5, 6, 6, 7, 7, 8, 9, 42. Which measure of center best represents a typical value in this data set?
You decide to use the median to describe the center of a data set. Which measure of variability is the best match for your choice?
Compute the IQR of this data set: 3, 4, 5, 6, 7, 8, 9.
Fluency
Compute the mean of the following data set: 3, 4, 4, 5, 5, 5, 6, 6, 7, 8.
Data set: 4, 5, 6, 6, 7, 7, 8, 9, 10, 13. The mean of this data set is 7.5. Compute the MAD.
A data set has one extremely large outlier. Which measure of variability is more resistant to the outlier's effect?
Compute the IQR of the following data set: 4, 5, 6, 6, 7, 7, 8, 9, 10, 13.
A random sample of word lengths from two science textbooks produced these results: 4th-grade — median = 5 letters, IQR = 2 letters; 7th-grade — median = 7 letters, IQR = 3 letters. Which statement best compares the two samples?
Varied Practice
A researcher compares two tutoring programs. Program A: mean score = 71. Program B: mean score = 83. A student writes: 'Program B is better because it has a higher mean.' What is the most important element missing from this comparison?
The dot plots show two versions of the same data set — one with an outlier included and one with the outlier removed. A triangle (▼) marks the mean and a vertical line (|) marks the median in each plot. What do the dot plots show about how the mean and median respond to the outlier?
Data set: 5, 6, 7, 8, 8, 9, 10. The median is ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ and the IQR is ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ .
Two data sets are compared: Group A has IQR = 9 and Group B has IQR = 2. A student concludes: 'Group B's data is more reliable because it has a smaller IQR.' What error is the student making?
Word Problems
A reading researcher randomly samples 10 words from a science chapter in a 4th-grade textbook and 10 words from a science chapter in a 7th-grade textbook. The word lengths (in letters) are:
4th-grade: 3, 4, 4, 5, 5, 5, 6, 6, 7, 8
7th-grade median = 7.0 letters, IQR = 3 letters.
Compute the median and IQR for the 4th-grade sample. The median is ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ and the IQR is ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ .
Using the 4th-grade statistics you computed and the 7th-grade statistics given (median = 7 letters, IQR = 3 letters), write a complete informal comparative inference statement. Your statement must include: (1) the names of both populations, (2) the direction of the difference, (3) actual center values, (4) reference to variability, and (5) hedging language.
A student records the daily study hours of 8 classmates: 2, 3, 3, 4, 4, 5, 5, 6. Compute the MAD.
A survey sampled 10 students from each of two after-school programs and asked how many hours per day they spend on screens. Results:
- Program A: mean daily screen time = 3 hours, MAD = 0.8 hours
- Program B: mean daily screen time = 5 hours, MAD = 0.9 hours
Write a complete informal comparative inference statement. Include hedging language and reference both center and variability.
A student writes: '7th-grade science books ALWAYS use longer words than 4th-grade science books — this is a fact proven by the data.' Which revision makes the statement statistically appropriate?
Error Analysis
A student records quiz scores for Team B: 7, 8, 8, 9, 9, 10, 42.
The student computes:
Sum
Mean
The student writes: "Team B's typical quiz score is about 13.3 points."
What error did the student make? What should the student do instead, and what value would better represent a typical score?
A student wants to compare word lengths in 7th-grade books vs. 4th-grade books.
The student randomly samples 10 words from a 7th-grade poetry anthology and 10 words from a 4th-grade science textbook. The student computes:
- 7th-grade poetry sample: median = 4 letters, IQR = 2
- 4th-grade science sample: median = 6 letters, IQR = 3
The student concludes: "4th-grade books use longer words than 7th-grade books, on average."
Identify the flaw in the student's reasoning. Why might the conclusion be misleading?
Challenge
Two groups of students recorded how many hours they slept last night.
Group M: 5, 6, 7, 8, 9, 10
Group N: 9, 10, 11, 12, 13, 14
(a) Compute the mean, median, MAD, and IQR for each group.
(b) Choose the most appropriate measure of center and measure of variability for this comparison, and explain your choice.
(c) Write a complete informal comparative inference statement using those measures.
Two data sets each have a mean of 10. Data Set X has MAD = 0.5. Data Set Y has MAD = 4.0. (a) Describe what each data set's distribution looks like based on these statistics. (b) Create a specific example data set of 6 values for each that is consistent with these statistics (values do not need to produce an exact MAD, but should be reasonable). (c) If you drew dot plots of X and Y on the same scale, describe how they would look different from each other.