GCSE Maths Foundation · AQA · Averages & spread
Computing mean and median: why you must sort first, use total = mean × n, and divide by Σf
Averages questions trip students who apply the recipe to the numbers as they appear on the page. For the median, they pick the physically middle entry of the list as written — without sorting. For a reverse-mean question, they average the values they can see — because the mean recipe “only runs forwards.” For a frequency table, they divide by the number of rows rather than by the total number of data points.
The thirty-second fix: the median requires a sorted list; the mean always links to a total via total = mean × n; and in a frequency table, n is Σf — the sum of the frequencies, never the number of rows.
Already know this is your gap? Skip the diagnostic and jump straight into the targeted lesson.
How to spot it in your own work
- You gave as the median of 7, 2, 9, 4, 5 — because 9 is in position 3 of the list as written — without sorting first.
- You found a missing value by averaging the numbers you could see, instead of computing total = mean × n and then subtracting.
- You divided by the number of rows in a frequency table instead of by .
- You picked one of the two middle values for an even-sized data set instead of averaging the two middle values.
An exam question that triggers it
Here is the canonical AQA Foundation trigger (Nov24 P3 Q12 shape):
The values 7, 2, 9, 4, 5 are given in this order. What is the median?
The misconception answer is — the value in position 3 of the list as written. But the median is the middle of the sorted list.
Sort: . Five values — the middle is position 3. Median = .
Why students fall for this
The word “median” activates the idea of “middle,” and students read “middle” as “the one in the middle of the page as written.” The sorting step is invisible — it is a structural requirement the recipe never announces.
The reverse-mean trap has the same root: students know the mean recipe runs , but when a value is missing they reach for the visible inputs and run the recipe on what they can see — averaging the three or four known values. The step “multiply mean × n to get the total” is not in their toolkit, because they have only ever run the recipe forwards.
In a frequency table the denominator confusion is structural: the table has rows, and dividing by the number of rows feels natural. But n in the mean formula refers to the total number of data points — which is , not the number of categories.
The fix: Sort first; total = mean × n; divide by Σf
Median: sort the data first, then pick the middle. For : sorted list is . Five values — middle is position 3. Median = . For an even count, average the two middle values.
Reverse mean: total = mean × n, then subtract the known sum. Four numbers, mean 10, three known values 5, 8, 9: total = 10 × 4 = 40; known sum = 5 + 8 + 9 = 22; fourth value = 40 − 22 = 18. Never average the values you can see.
Frequency table mean: divide by . is the total number of data points, not the number of rows. For x = 0, 1, 2, 3 with frequencies 2, 5, 8, 5: , , mean = 36 ÷ 20 = 1.8.
Worked example
Find the median of .
- Sort smallest to largest.
- Count the values. There are 5 values, so the middle is position 3.
- Pick the middle value.
The trap answer comes from reading position 3 of the original list. The original order is irrelevant — only the sorted order defines the median.
Find out if this is costing you marks
The 10-minute diagnostic checks for this pattern (and four others) using AQA-style GCSE Higher items. Free, no signup, anonymous.
Common questions
- How do I find the median of an even number of values?
Sort the values, then average the two middle ones. For six values 3, 5, 7, 9, 12, 15: the two middle values are position 3 and position 4, which are and . Median = .
- Why is it wrong to average the values I can see when finding the missing value in a reverse-mean question?
Averaging the known values gives the mean of those values — not the value that is missing. You need the grand total first: total = mean × n. Then subtract the known sum. For four numbers with mean 10 and known values 5, 8, 9: total = 40; known sum = 22; missing value = 18. Averaging 5, 8, 9 gives 7.33 — a completely different number.
- What is Σf and why do I divide by it?
is the sum of all the frequencies — the total number of data points in the study. When you calculate the mean you always divide by n, the total count. In a frequency table, n = Σf. The number of rows (categories) is irrelevant. For x = 0, 1, 2, 3 with frequencies 2, 5, 8, 5: , not 4.
- Does it matter what order I write the values in for the median?
The order you write them in has no effect on the median — only the sorted order matters. Whether the data is given as 7, 2, 9, 4, 5 or as 9, 7, 5, 4, 2, the sorted list is the same (2, 4, 5, 7, 9) and the median is the same (5).
Related misconceptions
- Inverse operations: why solving an equation means undoing each stepThe same forwards-only thinking that traps students on reverse-mean questions also traps them when solving equations — both need you to run an operation in reverse.
- Fraction of an amount: why 2/5 of 1020 is 408, not 40A related recipe-application error: copying a label instead of performing the operation the recipe demands.
- Combining unlike terms: why 9x + y − 6x + y is 3x + 2y, not 5xyAnother case where the structure of the operation (sort by type, collect within each type) must be applied before the arithmetic — the numbers alone are not enough.