Spread
Given the mean and standard deviation of a set of marks, what is
the greatest number of candidates who could have scored 100%?
Problem
Fifty candidates take a test in which they can get integer marks
from zero to 100. If the mean mark is 60 find the maximum number of
candidates who could have scored 100, the scores of the other
candidates in this case and the standard deviation of the marks.
Show that if the mean is 60 and the standard deviation is 8 then it is impossible for 2 candidates or more to have scored 100 and if the standard deviation is 8.2 then at most 2 candidates could have scored 100. Find a set of marks for which 2 candidates get 100 and the mean is 60 and find the standard deviation in this case.
Show that if the standard deviation is $4\sqrt 2$ or less then it is impossible for anyone to have scored 100 marks.
Show that if the mean is 60 and the standard deviation is 8 then it is impossible for 2 candidates or more to have scored 100 and if the standard deviation is 8.2 then at most 2 candidates could have scored 100. Find a set of marks for which 2 candidates get 100 and the mean is 60 and find the standard deviation in this case.
Show that if the standard deviation is $4\sqrt 2$ or less then it is impossible for anyone to have scored 100 marks.
Getting Started
If the number of candidates who scored 100 was as high as possible, given that the mean is 60, then the standard deviation will be high.
If you reduce the standard deviation then you cut down the number who could have scored 100. Work with the formula for the standard deviation and consider the contribution to that calculation from the top scores of 100 but don't forget the other candidates.
Student Solutions
Most of the following solution was submitted by Farrukh Jawed of Beaconhouse, Lahore.
The number of candidates $n$ is 50 and the mean score $\overline x$ is 60. For there to be the maximum number of students who gained 100 marks the rest must score 0. Let $y$ be the number who gained 100 and z be the number who gained 0 marks. Then $(100y + 0z )/50 = 60$ so $y = 30$. Therefore, 30 students scored 100 and 20 students scored 0.
The standard deviation is given by $$\sigma^2 = {\sum(x_i-\overline x ^2)\over n} = {1\over n}\sum x_i^2 - \overline x^2 = 2400$$ so the standard deviation $\sigma = 20\sqrt 6 = 49.0$ to 3 s.f.
For a lower standard deviation it is not possible for so many students to score 100. Now suppose the standard deviation $\sigma = 8$. If $y$ candidates score 100 then, as the marks of the remaining 48 candidates contribute to the standard deviation, we know $$\sigma^2 > {40^2y\over 50},$$ so we know $40^2 y < 3200$ so $y< 2$.
If the standard deviation is a little higher, say 8.2. Then $40^2y< 50\times 8.2^2$ so $y< 2.10125$. As $y$ is a whole number it could be 2 (but not more than 2).
If 2 students got 100, and the mean is 60, the sum of all the marks is 3000 and the sum of the scores of the remaining 48 students is 2800. Then 2800/48 = 58.3333.... so one set of results might be 47 students getting 58 marks, the 48th getting 74 marks, and 49th and 50th getting 100 marks. In this case the standard deviation is : $$\sigma ^2 = (2\times 100^2 + 74^2 + 47\times 58^2)/50 - 60^2 $$ which gives $\sigma = 8.4664$.
Another set of possible scores with a lower standard deviation, around 8.2 or less, requires all candidates who don't score 100 to get scores as near to 60 as possible. Suppose $q$ candidates score 59 and ($48-q)$ score 58 then $$59q + 58(48-q) =2800$$ so $q=16$. In this case 2 candidates score 100, 16 score 59 and 32 score 58. The standard deviation is given by $$\sigma^2 = {2\times 40^2 + 16\times 1^2 + 32\times 2^2 \over 50} = 66.88.$$ So the standard deviation is 8.178 to 3 d.p.
If the standard deviation $\sigma = 4\sqrt 2$, as we know that the number of candidates who score 100 is always less than $$50\sigma^2\over 40^2,$$ the number who could have scored 100 is less than $${50 \times 32\over 40^2} = 1$$ so it is impossible for any of the candidates to have scored 100.
Teachers' Resources
This type of problem is sometimes called an 'inverse' problem. Rather than being given some data and having to find a solution based on the data, in an inverse problem you are given some information about an outcome or solution and have to make deductions about the data.
Inverse problems can really exercise your problem solving skills because they require more understanding of the mathematical concepts involved than the more 'direct' type of problem.
Being able to solve problems of this type is important as they arise in real life situations where mathematics can be used to model the real life situation and so help in getting answers to questions about that situation. In such circumstances you may have to be creative in finding a method because there may be no known (standard) method for solving the problem.