Date | May 2022 | Marks available | 2 | Reference code | 22M.2.SL.TZ1.3 |
Level | Standard Level | Paper | Paper 2 | Time zone | Time zone 1 |
Command term | Find | Question number | 3 | Adapted from | N/A |
Question
The scores of the eight highest scoring countries in the 20192019 Eurovision song contest are shown in the following table.
For this data, find
Chester is investigating the relationship between the highest-scoring countries’ Eurovision score and their population size to determine whether population size can reasonably be used to predict a country’s score.
The populations of the countries, to the nearest million, are shown in the table.
Chester finds that, for this data, the Pearson’s product moment correlation coefficient is r=0.249r=0.249.
Chester then decides to find the Spearman’s rank correlation coefficient for this data, and creates a table of ranks.
Write down the value of:
the upper quartile.
the interquartile range.
Determine if the Netherlands’ score is an outlier for this data. Justify your answer.
State whether it would be appropriate for Chester to use the equation of a regression line for yy on xx to predict a country’s Eurovision score. Justify your answer.
aa.
bb.
cc.
Find the value of the Spearman’s rank correlation coefficient rsrs.
Interpret the value obtained for rsrs.
When calculating the ranks, Chester incorrectly read the Netherlands’ score as 478478. Explain why the value of the Spearman’s rank correlation rsrs does not change despite this error.
Markscheme
370+4722370+4722 (M1)
Note: This (M1) can also be awarded for either a correct Q3Q3 or a correct Q1Q1 in part (a)(ii).
Q3=421Q3=421 A1
[2 marks]
their part (a)(i) – their Q1Q1 (clearly stated) (M1)
IQR =(421-318=) 103=(421−318=) 103 A1
[2 marks]
(Q3+1.5Q3+1.5(IQR) ==) 421+(1.5×103)421+(1.5×103) (M1)
=575.5=575.5
since 498<575.5498<575.5 R1
Netherlands is not an outlier A1
Note: The R1 is dependent on the (M1). Do not award R0A1.
[3 marks]
not appropriate (“no” is sufficient) A1
as rr is too close to zero / too weak a correlation R1
[2 marks]
66 A1
[1 mark]
4.54.5 A1
[1 mark]
4.54.5 A1
[1 mark]
rs=0.683 (0.682646…)rs=0.683 (0.682646…) A2
[2 marks]
EITHER
there is a (positive) association between the population size and the score A1
OR
there is a (positive) linear correlation between the ranks of the population size and the ranks of the scores (when compared with the PMCC of 0.2490.249). A1
[1 mark]
lowering the top score by 2020 does not change its rank so rsrs is unchanged R1
Note: Accept “this would not alter the rank” or “Netherlands still top rank” or similar. Condone any statement that clearly implies the ranks have not changed, for example: “The Netherlands still has the highest score.”
[1 mark]
Examiners report
In part (a), many candidates could use their GDC to find the upper quartile, but many forgot how to find the inter-quartile range.
In part (b), very few candidates knew how to show if a score is an outlier. Many candidates did not know that there is a mathematical definition to “outlier” and simply wrote sentences explaining why or why not a value was an outlier.
In part (c), candidates were able to assess the validity of a regression line. The justifications for their conclusion revealed a partial or imprecise understanding of the topic. Examples of this include “no correlation”, “weak value of rr”, “low relationship”, “not close to 1”.
In part (d), about half of the candidates managed to find the correct values missing from the table.
In part (e), many candidates knew how to use their GDC to find Spearman’s rank correlation coefficient. Some mistakenly wrote down the value for r2r2 instead of rr. Very few candidates could correctly interpret the value for rr as they became confused by the fact that linear correlation must go with the rank, otherwise it is about association. They could either have said “there is an association between population size and score” or “there is a linear correlation between the rank order of the population size and the ranks of the scores”.
In part (f), most candidates were able to work out that, even if the score changed, the rank remained the same.