Question 1
The following table shows the data collected from an experiment.

The data is also represented on the following scatter diagram.

The relationship between `x` and `y` can be modelled by the regression line of `y` on `x` with equation `y=ax+b`, where `a,b in RR`.
(a) Write down the value of `a` and the value of `b`.
(b) Use this model to predict the value of `y` when `x=18`.
(c) Write down the value of `barx` and `bary`.
(d) Draw the line of best fit on the scatter diagram.
Mark as Complete
Mark Scheme
Question 2
At a café, the waiting time between ordering and receiving a cup of coffee is dependent upon the number of customers who have already ordered their coffee and are waiting to receive it.
Sarah, a regular customer, visited the café on five consecutive days. The following table shows the number of customers, `x`, ahead of Sarah who have already ordered and are waiting to receive their coffee and Sarah’s waiting time, `y` minutes.

The relationship between `x` and `y` can be modelled by the regression line of `y` on `x` with equation `y=ax+b`.
(a)
(i) Find the value of `a` and the value of `b`.
(ii) Write down the value of Pearson’s product-moment correlation coefficient, `r`.
(b) Interpret, in context, the value of `a` found in part (a)(i).
On another day, Sarah visits the café to order a coffee. Seven customers have already ordered their coffee and are waiting to receive it.
(c) Use the result from part (a)(i) to estimate Sarah’s waiting time to receive her coffee.
Mark as Complete
Mark Scheme
Question 3
In Lucy’s music academy, eight students took their piano diploma examination and achieved scores out of 150. For her records, Lucy decided to record the average number of hours per week each student reported practicing in the weeks prior to their examination. These results are summarized in the table below.
(a) Find Pearson’s product-moment correlation coefficient, `r`, for these data.
(b) The relationship between the variables can be modelled by the regression equation `D = ah + b`. Write down the value of `a` and the value of `b`.
(c) One of these eight students was disappointed with her result and wished she had practiced more. Based on the given data, determine how her score could have been expected to alter had she practiced an extra five hours per week.
Mark as Complete
Mark Scheme
Question 4
The following table shows the Mathematics test scores `(x)` and the Science test scores `(y)` for a group of eight students.

The regression line of `y` on `x` for this data can be written in the form `y=ax+b`.
(a) Find the value of `a` and the value of `b`.
(b) Write down the value of the Pearson’s product-moment correlation coefficient, `r`.
(c) Use the equation of your regression line to predict the Science test score for a student who has a score of 78 on the Mathematics test. Express your answer to the nearest integer.
Mark as Complete
Mark Scheme
Question 5
A botanist is conducting an experiment which studies the growth of plants.
The heights of the plants are measured on seven different days.
The following table shows the number of days, `d`, that the experiment has been running and the average height, `h` cm, of the plants on each of those days.

(a) The regression line of `h` on `d` for this data can be written in the form `h = ad + b`
Find the value of `a` and the value of `b`.
(b) Write down the value of the Pearson’s product-moment correlation coefficient, `r`.
(c) Use your regression line to estimate the average height of the plants when the experiment has been running for 20 days.
Mark as Complete
Mark Scheme
Question 6
A class is given two tests, Test A and Test B. Each test is scored out of a total of 100 marks. The scores of the students are shown in the following table.

Let `x` be the score on Test A and `y` be the score on Test B.
The teacher finds that the equation of the regression line of `y` on `x` for these scores is `y = 0.822x + 18.4`.
(a) Find the value of the Pearson’s product-moment correlation coefficient, `r`.
Giovanni was absent for Test A and Paulo was absent for Test B.
The teacher uses the regression line of `y` on `x` to estimate the missing scores.
Paulo scored 10 on Test A.
The teacher estimated his score on Test B to be 27 to the nearest integer using the following calculation:
`y = 0.822(10) + 18.4 ~~ 27`
(b) Give a reason why this method is not appropriate for Paulo.
Giovanni scored 90 on Test B.
The teacher estimated his score on Test A to be 87 to the nearest integer using the following calculation:
`90 = 0.822x + 18.4`, so `x = frac{90-18.4}{0.822}~~87`
(c)
(i) Give a reason why this method is not appropriate for Giovanni.
(ii) Use an appropriate method to show that the estimated Test A score for Giovanni is 86 to the nearest integer.
Mark as Complete
Mark Scheme
Question 7
Consider the following bivariate data set where `p,q in ZZ^+`.

The regression line of `y` on `x` has equation `y = 2.1875x + 0.6875`.
The regression line passes through the mean point `(bar x, bar y)`.
(a) Given that `bar x = 7`, verify that `bar y = 16`.
(b) Given that `q - p = 3`, find the value of `p` and the value of `q`.
Mark as Complete
Mark Scheme
Question 8
Tania wishes to see whether there is any correlation between a person’s age and the number of objects on a tray which could be remembered after looking at them for a certain time.
She obtains the following table of results.

(a) Use your graphic display calculator to find the equation of the regression line of `y` on `x`.
(b) Use your equation to estimate the number of objects remembered by a person aged 28 years.
(c) Use your graphic display calculator to find the correlation coefficient `r`.
(d) Comment on your value for `r`.
Mark as Complete
Mark Scheme
Question 9
(i) The following values are the product-moment correlation coefficients for the five scatter diagrams below.
0.50, –0.95, –0.60, 0.00, 0.90
Match each scatter diagram with its corresponding correlation coefficient.

(ii) The annual advertising expenditures and sales, in dollars, for a small company are listed below.

(a) Find the regression line `y = a + bx` for this data, and hence predict the annual sales that would result if 7000 dollars were spent in advertising. Give your answer to the nearest thousand dollars.
(b) If the annual advertising expenditures and annual sales figures above were converted into Japanese yen (1 dollar = 69.4017 yen) which of the following quantities would change and which would remain the same?
(iii) Intelligence Quotient (IQ) in a certain population is normally distributed with a mean of 100 and a standard deviation of 15.
(a) What percentage of the population has an IQ between 90 and 125?
(b) If two persons are chosen at random from the population, what is the probability that both have an IQ greater than 125?
(c) The mean IQ of a random group of 25 persons suffering from a certain brain disorder was found to be 95.2. Is this sufficient evidence, at the 0.05 level of significance, that people suffering from the disorder have, on average, a lower IQ than the entire population? State your null hypothesis and your alternative hypothesis, and explain your reasoning.
(iv) The manufacturer of ‘Fizz’, a popular soft drink, offered Millennium University one million dollars per year to make ‘Fizz’ the only soft drink sold on campus. A group of 140 students and teachers were consulted on whether the university should accept or reject the offer. Their responses are summarized in the following table:

(a) Construct a table of expected frequencies assuming that the attitude towards the offer is independent of whether the person is a student or a teacher. Do not apply Yates’ continuity correction.
(b) Calculate `\chi^2` for this set of data.
(c) Based on the `chi^2` test at 0.01 level of significance, which of the following conclusions may be drawn?
Students are more favourable than teachers towards accepting the offer.
Students are less favourable than teachers towards accepting the offer.
Acceptance or rejection does not depend on whether the person is a student or a teacher.
Explain briefly the reasons for your answers.
Mark as Complete
Mark Scheme
Question 10
(i) The mass of packets of a breakfast cereal is normally distributed with a mean of 750g and standard deviation of 25 g.
(a) Find the probability that a packet chosen at random has mass
(b) Two packets are chosen at random. What is the probability that both packets have a mass which is less than 740 g?
(c) The mass of 70% of the packets is more than 𝑥 grams. Find the value of `x`.
(ii) Three schools from the same city enter students for an examination in which successful candidates can achieve one of three grades: Pass, Credit, Distinction. The results are shown in the following table

It may be assumed that a student's result is independent of the school attended.
(a) The following table gives the expected frequencies for the above data.

(b) Newspapers wish to use these results to make comparisons between the schools. Based on the value of `chi^2`, decide whether there is justification for the statement 'success in the examination depends on which school is attended'. Examine the statement
(iii) A scientist is investigating the way in which the length of a metal rod varies with temperature. She reads the length 𝑦 mm at different temperatures 𝑥°C. From a set of these readings, she calculates the following results.
 `bar x = 200, bar y = 1000, s_x = 2.31, s_y = 11.7, s_{xy} = 26.1`
(a) Find
(b) Which of the following diagrams most closely resembles the set of readings taken by the scientist? Give a reason for your answer.

Mark as Complete
Mark Scheme
Question 1
The following table shows the data collected from an experiment.

The data is also represented on the following scatter diagram.

The relationship between `x` and `y` can be modelled by the regression line of `y` on `x` with equation `y=ax+b`, where `a,b in RR`.
(a) Write down the value of `a` and the value of `b`.
(b) Use this model to predict the value of `y` when `x=18`.
(c) Write down the value of `barx` and `bary`.
(d) Draw the line of best fit on the scatter diagram.
(a)
`a = 0.433156..., b = 4.50265...`
`a = 0.433, b = 4.50`
(b)
attempt to substitute `x=18` into their equation `y = 0.433 times 18 + 4.50`
 `= 12.2994...`
 `= 12.3`
(c) `bar x = 15, bar y = 11`
(d)

Question 2
At a café, the waiting time between ordering and receiving a cup of coffee is dependent upon the number of customers who have already ordered their coffee and are waiting to receive it.
Sarah, a regular customer, visited the café on five consecutive days. The following table shows the number of customers, `x`, ahead of Sarah who have already ordered and are waiting to receive their coffee and Sarah’s waiting time, `y` minutes.

The relationship between `x` and `y` can be modelled by the regression line of `y` on `x` with equation `y=ax+b`.
(a)
(i) Find the value of `a` and the value of `b`.
(ii) Write down the value of Pearson’s product-moment correlation coefficient, `r`.
(b) Interpret, in context, the value of `a` found in part (a)(i).
On another day, Sarah visits the café to order a coffee. Seven customers have already ordered their coffee and are waiting to receive it.
(c) Use the result from part (a)(i) to estimate Sarah’s waiting time to receive her coffee.
(a)
(i) `a = 0.805084...` and `b = 2.88135...` `=> a = 0.805` and `b = 2.88`
(ii) `r = 0.97777...` `=> r = 0.978`
(b) `a` represents the (average) increase in waiting time (0.805 mins) per additional customer (waiting to receive their coffee)
(c) attempt to substitute `x=7` into their equation
8.51693...
8.52 (mins)
Question 3
In Lucy’s music academy, eight students took their piano diploma examination and achieved scores out of 150. For her records, Lucy decided to record the average number of hours per week each student reported practicing in the weeks prior to their examination. These results are summarized in the table below.
(a) Find Pearson’s product-moment correlation coefficient, `r`, for these data.
(b) The relationship between the variables can be modelled by the regression equation `D = ah + b`. Write down the value of `a` and the value of `b`.
(c) One of these eight students was disappointed with her result and wished she had practiced more. Based on the given data, determine how her score could have been expected to alter had she practiced an extra five hours per week.
(a) use of GDC to give
`r = 0.883529...`
`r = 0.884`
(b) `a = 1.36609..., b = 64.5171...`
`a = 1.37, b = 64.5`
(c) attempt to find their difference
`5 times 1.36609...`
`6.83045...`
`= 6.83 (6.85` from `1.37)`
the student could have expected her score to increase by 7 marks.
Question 4
The following table shows the Mathematics test scores `(x)` and the Science test scores `(y)` for a group of eight students.

The regression line of `y` on `x` for this data can be written in the form `y=ax+b`.
(a) Find the value of `a` and the value of `b`.
(b) Write down the value of the Pearson’s product-moment correlation coefficient, `r`.
(c) Use the equation of your regression line to predict the Science test score for a student who has a score of 78 on the Mathematics test. Express your answer to the nearest integer.
(a) 1.01206..., 2.45230...
`a = 1.01, b = 2.45 \ (1.01x + 2.45)`
(b) 0.981464...
`r = 0.981`
(c) correct substitution of 78 into their regression equation
81.3930... 81.23 from 3 sf answer
81
Question 5
A botanist is conducting an experiment which studies the growth of plants.
The heights of the plants are measured on seven different days.
The following table shows the number of days, `d`, that the experiment has been running and the average height, `h` cm, of the plants on each of those days.

(a) The regression line of `h` on `d` for this data can be written in the form `h = ad + b`
Find the value of `a` and the value of `b`.
(b) Write down the value of the Pearson’s product-moment correlation coefficient, `r`.
(c) Use your regression line to estimate the average height of the plants when the experiment has been running for 20 days.
(a) `a = 1.93258..., b = 7.21662...`
`a = 1.93, b = 7.22`
(b) `r = 0.991087`
`r = 0.991`
(c) attempt to substitute `d = 20` into their equation
height = 45.8683
height = 45.9 (cm)
Question 6
A class is given two tests, Test A and Test B. Each test is scored out of a total of 100 marks. The scores of the students are shown in the following table.

Let `x` be the score on Test A and `y` be the score on Test B.
The teacher finds that the equation of the regression line of `y` on `x` for these scores is `y = 0.822x + 18.4`.
(a) Find the value of the Pearson’s product-moment correlation coefficient, `r`.
Giovanni was absent for Test A and Paulo was absent for Test B.
The teacher uses the regression line of `y` on `x` to estimate the missing scores.
Paulo scored 10 on Test A.
The teacher estimated his score on Test B to be 27 to the nearest integer using the following calculation:
`y = 0.822(10) + 18.4 ~~ 27`
(b) Give a reason why this method is not appropriate for Paulo.
Giovanni scored 90 on Test B.
The teacher estimated his score on Test A to be 87 to the nearest integer using the following calculation:
`90 = 0.822x + 18.4`, so `x = frac{90-18.4}{0.822}~~87`
(c)
(i) Give a reason why this method is not appropriate for Giovanni.
(ii) Use an appropriate method to show that the estimated Test A score for Giovanni is 86 to the nearest integer.
(a) `r = 0.901017...`
`r = 0.901`
(b) Student 11 Test B: should not extrapolate
(c)
(i) Student 12 Test A: should not use line of `y` on `x` to predict `x` from `y` (or equivalent)
(ii) attempt to find the equation of the regression line of `x` on `y`
`(x =) 0.987124...y − 3.21970... \ ((x =) 0.987y − 3.22)`
`(x =) 0.987124...(90) − 3.21970... \ (= 85.6214...)`
`= 86` to nearest integer
Question 7
Consider the following bivariate data set where `p,q in ZZ^+`.

The regression line of `y` on `x` has equation `y = 2.1875x + 0.6875`.
The regression line passes through the mean point `(bar x, bar y)`.
(a) Given that `bar x = 7`, verify that `bar y = 16`.
(b) Given that `q - p = 3`, find the value of `p` and the value of `q`.
(a) `bar y = 15.3125 + 0.6875`
`bar y = 16`
(b) attempt to use `16 = frac{sum y}{n}`, to form a linear equation in `p` and 𝑞.
`16 = \frac{9+13+p+q+21}{5}` `(80=p+q+43 => p+q=37)`
attempt to solve two linear equations simultaneously for `p` and `q` (one of which is `q = p + 3`)
`16 = \frac{9+13+p+p+3+21}{5}` `(80 = 2p + 46)`
`p = 17` and `q = 20`
Question 8
Tania wishes to see whether there is any correlation between a person’s age and the number of objects on a tray which could be remembered after looking at them for a certain time.
She obtains the following table of results.

(a) Use your graphic display calculator to find the equation of the regression line of `y` on `x`.
(b) Use your equation to estimate the number of objects remembered by a person aged 28 years.
(c) Use your graphic display calculator to find the correlation coefficient `r`.
(d) Comment on your value for `r`.
(a) `a = −0.134, b = 20.9`
`y = 20.9 − 0.134x`
(b) 17 objects
accept only 17
(c) `r = −0.756`
(d) Negative and moderately strong
Question 9
(i) The following values are the product-moment correlation coefficients for the five scatter diagrams below.
0.50, –0.95, –0.60, 0.00, 0.90
Match each scatter diagram with its corresponding correlation coefficient.

(ii) The annual advertising expenditures and sales, in dollars, for a small company are listed below.

(a) Find the regression line `y = a + bx` for this data, and hence predict the annual sales that would result if 7000 dollars were spent in advertising. Give your answer to the nearest thousand dollars.
(b) If the annual advertising expenditures and annual sales figures above were converted into Japanese yen (1 dollar = 69.4017 yen) which of the following quantities would change and which would remain the same?
(iii) Intelligence Quotient (IQ) in a certain population is normally distributed with a mean of 100 and a standard deviation of 15.
(a) What percentage of the population has an IQ between 90 and 125?
(b) If two persons are chosen at random from the population, what is the probability that both have an IQ greater than 125?
(c) The mean IQ of a random group of 25 persons suffering from a certain brain disorder was found to be 95.2. Is this sufficient evidence, at the 0.05 level of significance, that people suffering from the disorder have, on average, a lower IQ than the entire population? State your null hypothesis and your alternative hypothesis, and explain your reasoning.
(iv) The manufacturer of ‘Fizz’, a popular soft drink, offered Millennium University one million dollars per year to make ‘Fizz’ the only soft drink sold on campus. A group of 140 students and teachers were consulted on whether the university should accept or reject the offer. Their responses are summarized in the following table:

(a) Construct a table of expected frequencies assuming that the attitude towards the offer is independent of whether the person is a student or a teacher. Do not apply Yates’ continuity correction.
(b) Calculate `\chi^2` for this set of data.
(c) Based on the `chi^2` test at 0.01 level of significance, which of the following conclusions may be drawn?
Students are more favourable than teachers towards accepting the offer.
Students are less favourable than teachers towards accepting the offer.
Acceptance or rejection does not depend on whether the person is a student or a teacher.
Explain briefly the reasons for your answers.
(i)

(ii)
(a) Regression line (after scaling down `x` and `y` by a factor of 1000)
`y = 24250 + 18.5x` (or `y = 24.25 + 18.5x`)
for `x = 7000, y = 153750` (or for `x = 7, y = 153.75`)
Predicted sales: 154,000 dollars (to the nearest 1000 dollars)
(b) A change of scale `x′ = kx` and `y′ = ky` affects the mean and the standard deviation but not the correlation coefficient nor the gradient of the regression line.
`bar x` changes
`s_x` changes
`r` remains the same
the gradient of the regression line remains the same
(iii)
(a) P(90 < `X` < 125) = 70.1%
(b) P(`X >= 125`) = 0.0475 (or 0.0478)
P(both persons having IQ `>=` 125) = (0.0475)2 (or (0,0478)2) = 0.00226 (or 0.00228)
(c) Null hypothesis (`H_0`): mean IQ of people with disorder is 100
Alternative hypothesis (`H_1`): mean IQ of people with disorder is less than 100
`P(bar X < 95.2) = P(Z < frac{95.2-100}{15/\sqrt25}) = P(Z < -1.6) = 1 - 0.9452 = 0.0548`
The probability that the sample mean is 95.2 and the null hypothesis true is 0.0548 > 0.05. Hence the evidence is not sufficient.
(iv)
(a)

(b) `chi^2 = 4.90`
(c) P(4.90) = 0.0269
0.0269 > 0.01
Conclusion (iii)
Question 10
(i) The mass of packets of a breakfast cereal is normally distributed with a mean of 750g and standard deviation of 25 g.
(a) Find the probability that a packet chosen at random has mass
(b) Two packets are chosen at random. What is the probability that both packets have a mass which is less than 740 g?
(c) The mass of 70% of the packets is more than 𝑥 grams. Find the value of `x`.
(ii) Three schools from the same city enter students for an examination in which successful candidates can achieve one of three grades: Pass, Credit, Distinction. The results are shown in the following table

It may be assumed that a student's result is independent of the school attended.
(a) The following table gives the expected frequencies for the above data.

(b) Newspapers wish to use these results to make comparisons between the schools. Based on the value of `chi^2`, decide whether there is justification for the statement 'success in the examination depends on which school is attended'. Examine the statement
(iii) A scientist is investigating the way in which the length of a metal rod varies with temperature. She reads the length 𝑦 mm at different temperatures 𝑥°C. From a set of these readings, she calculates the following results.
 `bar x = 200, bar y = 1000, s_x = 2.31, s_y = 11.7, s_{xy} = 26.1`
(a) Find
(b) Which of the following diagrams most closely resembles the set of readings taken by the scientist? Give a reason for your answer.

(i)
(a) (These answers may be obtained from a calculator or by finding `z` in each case and the corresponding area.)
`M ~` N(750, 625)
(b) Independent events
Therefore, P(both < 740) = 0.3452 = 0.119
(c) 70% have mass < 763 g
Therefore, 70% have mass of at least 750 `-` 13
`x = 737` g
(ii)
(a)
(b) `H_0`: success is independent of school attended
`H_1`: success depends on the school
`chi_{10%}^2 (4) = 7.779; chi_{5%}^2 (4) = 9.488`
Since 8.12 > 7.779 we reject `H_0` at the 10 % level
Since 8.12 < 9.488 we accept `H_0` at the 5 % level
(iii)
(a)
(b) Diagram C
It has positive gradient, and points are close together.
Question 1
The following table shows the data collected from an experiment.

The data is also represented on the following scatter diagram.

The relationship between `x` and `y` can be modelled by the regression line of `y` on `x` with equation `y=ax+b`, where `a,b in RR`.
(a) Write down the value of `a` and the value of `b`.
(b) Use this model to predict the value of `y` when `x=18`.
(c) Write down the value of `barx` and `bary`.
(d) Draw the line of best fit on the scatter diagram.
Question 2
At a café, the waiting time between ordering and receiving a cup of coffee is dependent upon the number of customers who have already ordered their coffee and are waiting to receive it.
Sarah, a regular customer, visited the café on five consecutive days. The following table shows the number of customers, `x`, ahead of Sarah who have already ordered and are waiting to receive their coffee and Sarah’s waiting time, `y` minutes.

The relationship between `x` and `y` can be modelled by the regression line of `y` on `x` with equation `y=ax+b`.
(a)
(i) Find the value of `a` and the value of `b`.
(ii) Write down the value of Pearson’s product-moment correlation coefficient, `r`.
(b) Interpret, in context, the value of `a` found in part (a)(i).
On another day, Sarah visits the café to order a coffee. Seven customers have already ordered their coffee and are waiting to receive it.
(c) Use the result from part (a)(i) to estimate Sarah’s waiting time to receive her coffee.
Question 3
In Lucy’s music academy, eight students took their piano diploma examination and achieved scores out of 150. For her records, Lucy decided to record the average number of hours per week each student reported practicing in the weeks prior to their examination. These results are summarized in the table below.
(a) Find Pearson’s product-moment correlation coefficient, `r`, for these data.
(b) The relationship between the variables can be modelled by the regression equation `D = ah + b`. Write down the value of `a` and the value of `b`.
(c) One of these eight students was disappointed with her result and wished she had practiced more. Based on the given data, determine how her score could have been expected to alter had she practiced an extra five hours per week.
Question 4
The following table shows the Mathematics test scores `(x)` and the Science test scores `(y)` for a group of eight students.

The regression line of `y` on `x` for this data can be written in the form `y=ax+b`.
(a) Find the value of `a` and the value of `b`.
(b) Write down the value of the Pearson’s product-moment correlation coefficient, `r`.
(c) Use the equation of your regression line to predict the Science test score for a student who has a score of 78 on the Mathematics test. Express your answer to the nearest integer.
Question 5
A botanist is conducting an experiment which studies the growth of plants.
The heights of the plants are measured on seven different days.
The following table shows the number of days, `d`, that the experiment has been running and the average height, `h` cm, of the plants on each of those days.

(a) The regression line of `h` on `d` for this data can be written in the form `h = ad + b`
Find the value of `a` and the value of `b`.
(b) Write down the value of the Pearson’s product-moment correlation coefficient, `r`.
(c) Use your regression line to estimate the average height of the plants when the experiment has been running for 20 days.
Question 6
A class is given two tests, Test A and Test B. Each test is scored out of a total of 100 marks. The scores of the students are shown in the following table.

Let `x` be the score on Test A and `y` be the score on Test B.
The teacher finds that the equation of the regression line of `y` on `x` for these scores is `y = 0.822x + 18.4`.
(a) Find the value of the Pearson’s product-moment correlation coefficient, `r`.
Giovanni was absent for Test A and Paulo was absent for Test B.
The teacher uses the regression line of `y` on `x` to estimate the missing scores.
Paulo scored 10 on Test A.
The teacher estimated his score on Test B to be 27 to the nearest integer using the following calculation:
`y = 0.822(10) + 18.4 ~~ 27`
(b) Give a reason why this method is not appropriate for Paulo.
Giovanni scored 90 on Test B.
The teacher estimated his score on Test A to be 87 to the nearest integer using the following calculation:
`90 = 0.822x + 18.4`, so `x = frac{90-18.4}{0.822}~~87`
(c)
(i) Give a reason why this method is not appropriate for Giovanni.
(ii) Use an appropriate method to show that the estimated Test A score for Giovanni is 86 to the nearest integer.
Question 7
Consider the following bivariate data set where `p,q in ZZ^+`.

The regression line of `y` on `x` has equation `y = 2.1875x + 0.6875`.
The regression line passes through the mean point `(bar x, bar y)`.
(a) Given that `bar x = 7`, verify that `bar y = 16`.
(b) Given that `q - p = 3`, find the value of `p` and the value of `q`.
Question 8
Tania wishes to see whether there is any correlation between a person’s age and the number of objects on a tray which could be remembered after looking at them for a certain time.
She obtains the following table of results.

(a) Use your graphic display calculator to find the equation of the regression line of `y` on `x`.
(b) Use your equation to estimate the number of objects remembered by a person aged 28 years.
(c) Use your graphic display calculator to find the correlation coefficient `r`.
(d) Comment on your value for `r`.
Question 9
(i) The following values are the product-moment correlation coefficients for the five scatter diagrams below.
0.50, –0.95, –0.60, 0.00, 0.90
Match each scatter diagram with its corresponding correlation coefficient.

(ii) The annual advertising expenditures and sales, in dollars, for a small company are listed below.

(a) Find the regression line `y = a + bx` for this data, and hence predict the annual sales that would result if 7000 dollars were spent in advertising. Give your answer to the nearest thousand dollars.
(b) If the annual advertising expenditures and annual sales figures above were converted into Japanese yen (1 dollar = 69.4017 yen) which of the following quantities would change and which would remain the same?
(iii) Intelligence Quotient (IQ) in a certain population is normally distributed with a mean of 100 and a standard deviation of 15.
(a) What percentage of the population has an IQ between 90 and 125?
(b) If two persons are chosen at random from the population, what is the probability that both have an IQ greater than 125?
(c) The mean IQ of a random group of 25 persons suffering from a certain brain disorder was found to be 95.2. Is this sufficient evidence, at the 0.05 level of significance, that people suffering from the disorder have, on average, a lower IQ than the entire population? State your null hypothesis and your alternative hypothesis, and explain your reasoning.
(iv) The manufacturer of ‘Fizz’, a popular soft drink, offered Millennium University one million dollars per year to make ‘Fizz’ the only soft drink sold on campus. A group of 140 students and teachers were consulted on whether the university should accept or reject the offer. Their responses are summarized in the following table:

(a) Construct a table of expected frequencies assuming that the attitude towards the offer is independent of whether the person is a student or a teacher. Do not apply Yates’ continuity correction.
(b) Calculate `\chi^2` for this set of data.
(c) Based on the `chi^2` test at 0.01 level of significance, which of the following conclusions may be drawn?
Students are more favourable than teachers towards accepting the offer.
Students are less favourable than teachers towards accepting the offer.
Acceptance or rejection does not depend on whether the person is a student or a teacher.
Explain briefly the reasons for your answers.
Question 10
(i) The mass of packets of a breakfast cereal is normally distributed with a mean of 750g and standard deviation of 25 g.
(a) Find the probability that a packet chosen at random has mass
(b) Two packets are chosen at random. What is the probability that both packets have a mass which is less than 740 g?
(c) The mass of 70% of the packets is more than 𝑥 grams. Find the value of `x`.
(ii) Three schools from the same city enter students for an examination in which successful candidates can achieve one of three grades: Pass, Credit, Distinction. The results are shown in the following table

It may be assumed that a student's result is independent of the school attended.
(a) The following table gives the expected frequencies for the above data.

(b) Newspapers wish to use these results to make comparisons between the schools. Based on the value of `chi^2`, decide whether there is justification for the statement 'success in the examination depends on which school is attended'. Examine the statement
(iii) A scientist is investigating the way in which the length of a metal rod varies with temperature. She reads the length 𝑦 mm at different temperatures 𝑥°C. From a set of these readings, she calculates the following results.
 `bar x = 200, bar y = 1000, s_x = 2.31, s_y = 11.7, s_{xy} = 26.1`
(a) Find
(b) Which of the following diagrams most closely resembles the set of readings taken by the scientist? Give a reason for your answer.
