Chuyển tới nội dung chính

Statistics Basics — Mean, Median, Mode, and Data Analysis

Grade: 6-8 | Topic: Statistics

What You Will Learn

By the end of this guide you will know how to calculate the mean, median, mode, and range of a data set. You will understand when to use each measure of central tendency and how to interpret data to draw meaningful conclusions. These foundational statistics skills appear in every standardized math test from grade 6 onward and are essential for real-world data analysis.

Theory

Measures of Central Tendency

Statistics is the branch of math that deals with collecting, organizing, and interpreting data. The most important starting point is understanding measures of central tendency — single numbers that summarize a whole data set by describing its "center."

There are three main measures:

MeasureWhat It Tells YouBest Used When
MeanThe average valueData is evenly spread, no extreme outliers
MedianThe middle valueData is skewed or has outliers
ModeThe most frequent valueYou want the most common category or result

Mean (Average)

The mean is the sum of all values divided by how many values there are:

Mean=Sum of all valuesNumber of values=xn\text{Mean} = \frac{\text{Sum of all values}}{\text{Number of values}} = \frac{\sum x}{n}

Where x\sum x is the sum of every data value and nn is the total count.

For example, if five students scored 70, 80, 85, 90, and 95 on a test:

Mean=70+80+85+90+955=4205=84\text{Mean} = \frac{70 + 80 + 85 + 90 + 95}{5} = \frac{420}{5} = 84

The mean is sensitive to outliers — one very high or very low value can pull the mean away from the typical value.

Median (Middle Value)

The median is the middle value when all data points are arranged in order from least to greatest.

  • If nn is odd, the median is the value at position n+12\frac{n+1}{2}.
  • If nn is even, the median is the average of the two middle values:

Median (even n)=xn/2+x(n/2)+12\text{Median (even } n\text{)} = \frac{x_{n/2} + x_{(n/2)+1}}{2}

The median is not affected by outliers, making it a more reliable measure for skewed data.

Mode (Most Frequent Value)

The mode is the value that appears most often. A data set can have:

  • No mode — all values appear the same number of times
  • One mode (unimodal) — one value appears more than any other
  • Two modes (bimodal) — two values tie for the highest frequency
  • Multiple modes (multimodal) — three or more values tie

Range (Spread of Data)

The range measures how spread out the data is:

Range=Maximum valueMinimum value\text{Range} = \text{Maximum value} - \text{Minimum value}

A small range means the data points are close together; a large range means they are spread out.

Data Representation

Once you have calculated summary statistics, you can visualize data using charts and graphs:

  • Bar charts — compare quantities across categories
  • Pie charts — show parts of a whole as percentages
  • Line graphs — display trends over time
  • Dot plots / frequency tables — show how often each value occurs

Choosing the right representation makes patterns and outliers easy to spot at a glance.

Worked Examples

Example 1: Finding Mean, Median, Mode, and Range (Basic)

Problem: A student received the following scores on six quizzes: 78, 85, 92, 85, 90, 88. Find the mean, median, mode, and range.

Step 1 — Mean: Add all values and divide by 6.

Mean=78+85+92+85+90+886=518686.3\text{Mean} = \frac{78 + 85 + 92 + 85 + 90 + 88}{6} = \frac{518}{6} \approx 86.3

Step 2 — Median: Arrange in order: 78, 85, 85, 88, 90, 92. Since n=6n = 6 (even), average the 3rd and 4th values.

Median=85+882=1732=86.5\text{Median} = \frac{85 + 88}{2} = \frac{173}{2} = 86.5

Step 3 — Mode: The value 85 appears twice; all others appear once.

Mode = 85

Step 4 — Range:

Range=9278=14\text{Range} = 92 - 78 = 14

Answer: Mean 86.3\approx 86.3, Median =86.5= 86.5, Mode =85= 85, Range =14= 14.

Example 2: Mean with a Missing Value (Medium)

Problem: The mean of five test scores is 82. Four of the scores are 75, 80, 88, and 90. What is the fifth score?

Step 1: Use the mean formula in reverse. If the mean is 82 and there are 5 scores, the total sum must be:

Sum=Mean×n=82×5=410\text{Sum} = \text{Mean} \times n = 82 \times 5 = 410

Step 2: Add the four known scores:

75+80+88+90=33375 + 80 + 88 + 90 = 333

Step 3: Subtract to find the missing score:

410333=77410 - 333 = 77

Answer: The fifth score is 77.

Example 3: Median with an Even Number of Values (Medium)

Problem: A class of 8 students recorded their heights in cm: 152, 148, 160, 155, 148, 163, 157, 150. Find the median height.

Step 1: Arrange in ascending order:

148,  148,  150,  152,  155,  157,  160,  163148, \; 148, \; 150, \; 152, \; 155, \; 157, \; 160, \; 163

Step 2: Since n=8n = 8 (even), find the 4th and 5th values: 152 and 155.

Median=152+1552=3072=153.5 cm\text{Median} = \frac{152 + 155}{2} = \frac{307}{2} = 153.5 \text{ cm}

Answer: The median height is 153.5 cm.

Example 4: Choosing the Best Measure (Challenging)

Problem: A small company has 6 employees with the following monthly salaries (in dollars): 3,000; 3,200; 3,400; 3,500; 3,600; 15,000. Which measure of central tendency best represents the typical salary?

Step 1 — Mean:

Mean=3,000+3,200+3,400+3,500+3,600+15,0006=31,70065,283\text{Mean} = \frac{3{,}000 + 3{,}200 + 3{,}400 + 3{,}500 + 3{,}600 + 15{,}000}{6} = \frac{31{,}700}{6} \approx 5{,}283

Step 2 — Median: Data is already ordered. Average the 3rd and 4th values:

Median=3,400+3,5002=3,450\text{Median} = \frac{3{,}400 + 3{,}500}{2} = 3{,}450

Step 3 — Mode: No value repeats, so there is no mode.

Step 4 — Analysis: The salary of $15,000 is an outlier. The mean ($5,283) is pulled upward and does not reflect what a typical employee earns. The median ($3,450) is much closer to what most employees actually make.

Answer: The median ($3,450) best represents the typical salary because the data contains an outlier that skews the mean.

Example 5: Frequency Table and Mode (Challenging)

Problem: A survey asked 20 students how many books they read last month. The results:

Books read012345
Frequency257321

Find the mean, median, and mode.

Step 1 — Mean: Multiply each value by its frequency, sum, and divide by total:

Sum=(0×2)+(1×5)+(2×7)+(3×3)+(4×2)+(5×1)\text{Sum} = (0 \times 2) + (1 \times 5) + (2 \times 7) + (3 \times 3) + (4 \times 2) + (5 \times 1)

=0+5+14+9+8+5=41= 0 + 5 + 14 + 9 + 8 + 5 = 41

Mean=4120=2.05\text{Mean} = \frac{41}{20} = 2.05

Step 2 — Median: With 20 values (even), find the 10th and 11th values. Count cumulatively: positions 1-2 are 0, positions 3-7 are 1, positions 8-14 are 2. Both the 10th and 11th values are 2.

Median=2+22=2\text{Median} = \frac{2 + 2}{2} = 2

Step 3 — Mode: The value 2 has the highest frequency (7).

Mode = 2

Answer: Mean =2.05= 2.05, Median =2= 2, Mode =2= 2.

Common Mistakes

Mistake 1: Forgetting to sort the data before finding the median

❌ Data: 85, 70, 92, 78, 90 — picking the middle value directly gives 92.

✅ Sort first: 70, 78, 85, 90, 92 — the correct median is 85.

Why this matters: The median requires ordered data. Picking the middle position of unsorted data gives a random value, not the true center.

Mistake 2: Dividing by the wrong number when calculating the mean

❌ Scores: 80, 90, 100. Mean =80+90+1002=135= \frac{80 + 90 + 100}{2} = 135 (divided by 2 instead of 3).

✅ Mean =80+90+1003=2703=90= \frac{80 + 90 + 100}{3} = \frac{270}{3} = 90.

Why this matters: The denominator must equal the total number of values, not the number of distinct values or any other count.

Mistake 3: Saying "no mode" when there are repeated values

❌ Data: 4, 4, 7, 7, 9. Student says "no mode" because two values tie.

✅ This data set is bimodal — it has two modes: 4 and 7.

Why this matters: "No mode" only applies when every value appears the same number of times. When two or more values tie for the highest frequency, they are all modes.

Practice Problems

Try these on your own before checking the answers:

  1. Find the mean, median, mode, and range of: 12, 15, 18, 15, 20, 10, 15.
  2. The mean of four numbers is 25. Three of the numbers are 20, 30, and 22. What is the fourth number?
  3. A teacher records the following test scores for 10 students: 65, 72, 78, 80, 80, 85, 88, 90, 92, 95. Would the mean or the median better represent the class performance? Explain your reasoning.
  4. A data set has values: 3, 7, 7, 10, 12, 12, 15. Is it unimodal, bimodal, or multimodal? What is the range?
Click to see answers
  1. Sorted: 10, 12, 15, 15, 15, 18, 20. Mean =105÷7=15= 105 \div 7 = 15. Median =15= 15 (4th value). Mode =15= 15 (appears 3 times). Range =2010=10= 20 - 10 = 10.
  2. Sum needed =25×4=100= 25 \times 4 = 100. Known sum =20+30+22=72= 20 + 30 + 22 = 72. Fourth number =10072=28= 100 - 72 = 28.
  3. Both the mean (82.582.5) and median (82.582.5) happen to be equal here. Since the data is roughly symmetric with no extreme outliers, either measure works well. In symmetric data, mean and median converge.
  4. Bimodal — both 7 and 12 appear twice. Range =153=12= 15 - 3 = 12.

Summary

  • The mean (average) is the sum of all values divided by the count — best for symmetric data without outliers.
  • The median is the middle value of sorted data — best when data is skewed or has extreme values.
  • The mode is the most frequent value — useful for categorical data or finding the most common result.
  • The range measures spread: maximum minus minimum.
  • Always sort your data before finding the median, and always count carefully when computing the mean.
  • Choosing the right measure of central tendency depends on the shape of the data and whether outliers are present.

Need help with statistics problems?

Take a photo of your math problem and MathPal will solve it step by step.

Open MathPal