User interface language: English | Español

Date November 2009 Marks available 2 Reference code 09N.2.sl.TZ0.4
Level SL only Paper 2 Time zone TZ0
Command term Decide and Give a reason Question number 4 Adapted from N/A

Question

In a mountain region there appears to be a relationship between the number of trees growing in the region and the depth of snow in winter. A set of 10 areas was chosen, and in each area the number of trees was counted and the depth of snow measured. The results are given in the table below.

In a study on \(100\) students there seemed to be a difference between males and females in their choice of favourite car colour. The results are given in the table below. A \(\chi^2\) test was conducted.

Use your graphic display calculator to find the mean number of trees.

[1]
A, a, i.

Use your graphic display calculator to find the mean depth of snow.

[1]
A, a, iii.

Use your graphic display calculator to find the standard deviation of the depth of snow.

[1]
A, a, iv.

The covariance, Sxy = 188.5.

Write down the product-moment correlation coefficient, r.

[2]
A, b.

Write down the equation of the regression line of y on x.

[2]
A, c.

If the number of trees in an area is 55, estimate the depth of snow.

[2]
A, d.

Use the equation of the regression line to estimate the depth of snow in an area with 100 trees.

[1]
A, e, i.

Decide whether the answer in (e)(i) is a valid estimate of the depth of snow in the area. Give a reason for your answer.

[2]
A, e, ii.

Write down the total number of male students.

[1]
B, a.

Show that the expected frequency for males, whose favourite car colour is blue, is 12.6.

[2]
B, b.

The calculated value of \({\chi ^2}\) is \(1.367\) and the critical value of \({\chi ^2}\) is \(5.99\) at the \(5\%\) significance level.

Write down the null hypothesis for this test.

[1]
B, c, i.

The calculated value of \({\chi ^2}\) is \(1.367\) and the critical value of \({\chi ^2}\) is \(5.99\) at the \(5\%\) significance level.

Write down the number of degrees of freedom.

[1]
B, c, ii.

The calculated value of \({\chi ^2}\) is \(1.367\) and the critical value of \({\chi ^2}\) is \(5.99\) at the \(5\%\) significance level.

Determine whether the null hypothesis should be accepted at the \(5\%\) significance level. Give a reason for your answer.

[2]
B, c, iv.

Markscheme

50     (G1)

[1 mark]

A, a, i.

30.5     (G1)

[1 mark]
A, a, iii.

12.3     (G1)


Note: Award (A1)(ft) for 13.0 in (iv) but only if 17.7 seen in (a)(ii).

 
[1 mark]
A, a, iv.

\(r = \frac{{188.5}}{{(16.79 \times 12.33)}}\)     (M1)


Note: Award (M1) for using their values in the correct formula.


= 0.911 (accept 0.912, 0.910)     (A1)(ft)(G2)

[2 marks]

A, b.

y = 0.669x 2.95     (G1)(G1)


Note: Award (G1) for 0.669x, (G1) for −2.95. If the answer is not in the form of an equation, award at most (G1)(G0).

 

[2 marks]

A, c.

Depth = 0.669 × 55 − 2.95     (M1)

= 33.8     (A1)(ft)(G2)(ft)


Note: Follow through from their (c) even if no working seen.

 

[2 marks]

A, d.

64.0 (accept 63.95, 63.9)     (A1)(ft)(G1)(ft)


Note: Follow through from their (c) even if no working seen.

 

[1 mark]

A, e, i.

It is not valid. It lies too far outside the values that are given. Or equivalent.     (A1)(R1)


Note: Do not award (A1)(R0).

 

[2 marks]

A, e, ii.

28     (A1)

[1 mark]

B, a.

\(\frac{{28 \times 45}}{{100}}\left( {\frac{{28}}{{100}} \times \frac{{45}}{{100}} \times 100} \right)\)     (M1)(A1)(ft)


Note: Award (M1) for correct formula, (A1) for correct substitution.


= 12.6     (AG)


Note: Do not award (A1) unless 12.6 seen.

 

[2 marks]

B, b.

the favourite car colour is independent of gender.     (A1)

 

Note: Accept there is no association between gender and favourite car colour.

Do not accept ‘not related’ or ‘not correlated’.

 

[1 mark]

B, c, i.

\(2\)     (A1)

[1 marks]

B, c, ii.

Accept the null hypothesis since \(1.367 < 5.991\)     (A1)(ft)(R1)

 

Note: Allow “Do not reject”. Follow through from their null hypothesis and their critical value.

Full credit for use of \(p\)-values from GDC [\(p = 0.505\)].

Do not award (A1)(R0). Award (R1) for valid comparison.

 

[2 marks]

B, c, iv.

Examiners report

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

The use of the inappropriate standard deviation was seen, but infrequently.

A, a, i.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

The use of the inappropriate standard deviation was seen, but infrequently.

A, a, iii.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

The use of the inappropriate standard deviation was seen, but infrequently.

A, a, iv.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

It is expected that the GDC is used to calculate the correlation coefficient; the covariance was given to aid those candidates for whom the reset process removes this function from the display. It is anticipated that this hint will not be given in future papers.

A, b.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

A, c.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

A, d.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

A, e, i.

A straightforward question that saw many fine attempts. Given its nature – where much of the work was done on the GDC – it must be emphasised to candidates that incorrect entry of data into the calculator will result in considerable penalties; they must check their data entry most carefully.

The dangers of extrapolation should be clearly explained to students.

A, e, ii.

Once again, a straightforward question on chi-squared testing that was either highly successful (for the majority) or showed a lack of syllabus coverage.

B, a.

Once again, a straightforward question on chi-squared testing that was either highly successful (for the majority) or showed a lack of syllabus coverage. A surprising number of candidates lacked knowledge of the theory underlying the test and were thus unable to attempt (b).

B, b.

Once again, a straightforward question on chi-squared testing that was either highly successful (for the majority) or showed a lack of syllabus coverage. In (c)(i) it is worth stressing that the test is for the mathematical independence of two characteristics and this determines the null hypothesis.

B, c, i.

Once again, a straightforward question on chi-squared testing that was either highly successful (for the majority) or showed a lack of syllabus coverage.

B, c, ii.

Once again, a straightforward question on chi-squared testing that was either highly successful (for the majority) or showed a lack of syllabus coverage. A number of candidates confuse the critical value and \(p\)-value approach to the test and thus lost marks in (c)(iv).

B, c, iv.

Syllabus sections

Topic 4 - Statistical applications » 4.3 » Use of the regression line for prediction purposes.
Show 38 related questions

View options