User interface language: English | Español

Date May 2022 Marks available 1 Reference code 22M.2.SL.TZ1.3
Level Standard Level Paper Paper 2 Time zone Time zone 1
Command term Interpret Question number 3 Adapted from N/A

Question

The scores of the eight highest scoring countries in the 2019 Eurovision song contest are shown in the following table.

For this data, find

Chester is investigating the relationship between the highest-scoring countries’ Eurovision score and their population size to determine whether population size can reasonably be used to predict a country’s score.

The populations of the countries, to the nearest million, are shown in the table.

Chester finds that, for this data, the Pearson’s product moment correlation coefficient is r=0.249.

Chester then decides to find the Spearman’s rank correlation coefficient for this data, and creates a table of ranks.

Write down the value of:

the upper quartile.

[2]
a.i.

the interquartile range.

[2]
a.ii.

Determine if the Netherlands’ score is an outlier for this data. Justify your answer.

[3]
b.

State whether it would be appropriate for Chester to use the equation of a regression line for y on x to predict a country’s Eurovision score. Justify your answer.

[2]
c.

a.

[1]
d.i.

b.

[1]
d.ii.

c.

[1]
d.iii.

Find the value of the Spearman’s rank correlation coefficient rs.

[2]
e.i.

Interpret the value obtained for rs.

[1]
e.ii.

When calculating the ranks, Chester incorrectly read the Netherlands’ score as 478. Explain why the value of the Spearman’s rank correlation rs does not change despite this error.

[1]
f.

Markscheme

370+4722         (M1)


Note: This (M1) can also be awarded for either a correct Q3 or a correct Q1 in part (a)(ii).


Q3=421         A1

 

[2 marks]

a.i.

their part (a)(i) – their Q1   (clearly stated)        (M1)

IQR =421-318= 103         A1

 

[2 marks]

a.ii.

(Q3+1.5(IQR) =421+1.5×103        (M1)

=575.5

since 498<575.5         R1

Netherlands is not an outlier         A1


Note: The R1 is dependent on the (M1). Do not award R0A1.

 

[3 marks]

b.

not appropriate (“no” is sufficient)          A1

as r is too close to zero / too weak a correlation          R1

 

[2 marks]

c.

6          A1

 

[1 mark]

d.i.

4.5          A1

 

[1 mark]

d.ii.

4.5          A1

 

[1 mark]

d.iii.

rs=0.683   0.682646          A2

 

[2 marks]

e.i.

EITHER

there is a (positive) association between the population size and the score        A1


OR

there is a (positive) linear correlation between the ranks of the population size and the ranks of the scores (when compared with the PMCC of 0.249).        A1

 

[1 mark]

e.ii.

lowering the top score by 20 does not change its rank so rs is unchanged       R1


Note: Accept “this would not alter the rank” or “Netherlands still top rank” or similar. Condone any statement that clearly implies the ranks have not changed, for example: “The Netherlands still has the highest score.”

 

[1 mark]

f.

Examiners report

In part (a), many candidates could use their GDC to find the upper quartile, but many forgot how to find the inter-quartile range.

In part (b), very few candidates knew how to show if a score is an outlier. Many candidates did not know that there is a mathematical definition to “outlier” and simply wrote sentences explaining why or why not a value was an outlier.

In part (c), candidates were able to assess the validity of a regression line. The justifications for their conclusion revealed a partial or imprecise understanding of the topic. Examples of this include “no correlation”, “weak value of r”, “low relationship”, “not close to 1”.

In part (d), about half of the candidates managed to find the correct values missing from the table.

In part (e), many candidates knew how to use their GDC to find Spearman’s rank correlation coefficient. Some mistakenly wrote down the value for r2 instead of r. Very few candidates could correctly interpret the value for r as they became confused by the fact that linear correlation must go with the rank, otherwise it is about association. They could either have said “there is an association between population size and score” or “there is a linear correlation between the rank order of the population size and the ranks of the scores”.

In part (f), most candidates were able to work out that, even if the score changed, the rank remained the same.

a.i.
[N/A]
a.ii.
[N/A]
b.
[N/A]
c.
[N/A]
d.i.
[N/A]
d.ii.
[N/A]
d.iii.
[N/A]
e.i.
[N/A]
e.ii.
[N/A]
f.

Syllabus sections

Topic 4—Statistics and probability » SL 4.10—Spearman’s rank correlation coefficient
Show 30 related questions
Topic 4—Statistics and probability

View options