Median and IQR for 7th-Grade Words
Ordered: 4, 5, 6, 6, 7, 7, 8, 9, 10, 13
- Median =
letters - Lower half → Q1 = 6; Upper half → Q3 = 9
- IQR =
letters
Mean and MAD for 4th-Grade Words
Mean =
Absolute deviations from 5.3:
2.3, 1.3, 1.3, 0.3, 0.3, 0.3, 0.7, 0.7, 1.7, 2.7
Sum = 11.6 → MAD =
Mean and MAD for 7th-Grade Words
Mean =
| 4 | 3.5 | 7 | 0.5 |
| 5 | 2.5 | 8 | 0.5 |
| 6 | 1.5 | 9 | 1.5 |
| 6 | 1.5 | 10 | 2.5 |
| 7 | 0.5 | 13 | 5.5 |
Sum = 20.0 → MAD =
Comparison Table: All Four Statistics
7th-grade is consistently higher — longer words and more variable.
Interpreting the Comparison Table Statistics
- Median difference: 7.0 − 5.0 = 2 letters
- 4th-grade IQR = 2; 7th-grade IQR = 3
- A 2-letter gap equals one full 4th-grade IQR — a meaningful difference
What inference can we draw about these two science texts?
Five Components of a Complete Inference
A complete informal inference statement must include:
- Name both populations being compared
- State the direction of the difference
- Give the actual center values and the difference
- Reference variability to contextualize the difference
- Use hedging language — "tends to be," "on average," "based on this sample"
Inference Steps 1 and 2: Populations and Choice
Step 1 — Name the populations:
"We are comparing word lengths in a typical 7th-grade science chapter and a typical 4th-grade science chapter."
Step 2 — Choose measures and justify:
The 7th-grade data is skewed right (value of 13), so we use median + IQR.
Inference Steps 3 and 4: Numbers Matter
Step 3 — State the difference:
"Median: 7.0 letters (7th grade) vs. 5.0 letters (4th grade) — a gap of 2 letters."
Step 4 — Reference variability:
"4th-grade IQR = 2 letters, so a 2-letter gap equals one full IQR — a meaningful difference."
Inference Step 5: Hedging and Full Statement
Step 5 — State the inference with hedging:
"Based on these samples, words in 7th-grade science books tend to be longer than words in 4th-grade science books. This pattern likely holds more broadly, though a larger sample would give us greater confidence."
Complete inference: steps 1 through 5 combined.
Calibrating Confidence in Inference Statements
| Language | Verdict |
|---|---|
| "7th-grade always uses longer words." | |
| "We can't say — only 10 words." | |
| "7th-grade tends to use longer words." | ✓ Appropriate |
Hedging is accuracy, not weakness.
Your Turn: Comparing Daily Screen Time
Group A (jobs): 1, 2, 2, 3, 3, 4, 4, 5, 5, 6 hrs
Group B (no jobs): 3, 4, 4, 5, 5, 5, 6, 7, 7, 9 hrs
- Compute median + IQR for each group
- Choose the measure pair; justify
- Write a complete five-component inference
Screen Time Statistics: Answers Revealed
| Group A | Group B | |
|---|---|---|
| Mean | 3.5 | 5.5 |
| Median | 3.5 | 5.5 |
| MAD | 1.3 | 1.3 |
| IQR | ≈ 2 | ≈ 2 |
Symmetric distributions — either measure pair valid.
Check-In: Evaluate This Inference Statement
"Students with after-school jobs use screens less. Group A's mean is 3.5 and Group B's is 5.5. The data proves this is always true."
What is missing or wrong? Use the five-component checklist.
Identify problems before advancing
Rate This Inference: Problems Found
- ✓ Populations partially named (Group B missing)
- ✓ Direction stated
- ✓ Values given
No variability — missing component 4
Over-confident — "proves," "always true"
Fix: add IQR comparison; replace "proves" with "tends to"
How Reliable Is a 10-Word Inference?
What if we sampled only 3 words from each chapter?
- 7th-grade: 3, 5, 12 → Median 5, Mean 6.7
- 4th-grade: 4, 4, 11 → Median 4, Mean 6.3
One long word equalized the means — small samples are noisy.
Larger Samples Produce More Stable Estimates
As
Three Phrases: Shows, Suggests, Proves
| Phrase | When to use |
|---|---|
| "The data shows…" | Describing your sample |
| "The data suggests…" | Inferring about the population |
| "The data proves…" | Almost never — samples carry uncertainty |
"Informal" means no probability calculations, not careless reasoning.
Check-In: Which Language Fits Here?
Choose: "shows," "suggests," or "proves"
- You sampled 4 words; 7th-grade median was higher.
- You sampled 200 words from 50 chapters; results consistent.
- You are reporting the mean of your 10 measured words.
Decide before advancing
Which Language Fits: Answers Revealed
- 4 words sampled → "suggests" (tentative; very small sample)
- 200 words, 50 chapters → "suggests" (stronger, but still a sample)
- Reporting measured mean → "shows" (describing your data, not inferring)
"Proves" requires every word from every book — that's a census, not a sample.
Key Takeaways from Today's Lesson
- ✓ Choose measures first: symmetric → mean + MAD; skewed → median + IQR
- ✓ Complete inferences need: populations, direction, values, variability, hedging
- ✓ Gap compared to spread shows whether difference is meaningful
- ✓ Larger samples are more reliable — always hedge
Watch Out: Six Common Inference Errors
- No variability — inference without spread is incomplete
- Sample = population — "proves" overstates a sample
- Default to mean — check shape first
- High IQR = worse — variability describes the data
- Too certain — "always" and "proves" rarely fit
- Unfair comparison — use random comparable samples
Next: Applying Inference to Real Populations
In the next lesson, we'll apply informal comparative inference to real-world population data — income distributions, age distributions across countries, and survey results.
Coming up:
- Larger data sets with more complex distributions
- Choosing measures when data has multiple features to consider
- Connecting statistical inference to decision-making
Click to begin the narrated lesson
Use measures of center and measures of variability to draw informal comparative inferences about two populations