SOLUTION: Good afternoon, please help me solve this problem. During the baking of a certain type of bread roll, each bread goes through a series of heat process. The length of time spent un

Algebra ->  Probability-and-statistics -> SOLUTION: Good afternoon, please help me solve this problem. During the baking of a certain type of bread roll, each bread goes through a series of heat process. The length of time spent un      Log On


   



Question 1208428: Good afternoon, please help me solve this problem.
During the baking of a certain type of bread roll, each bread goes through a series of heat process. The length of time spent under this heat treatment is related to the lifespan of the bread rolls. A sample of eight bread rolls that underwent different baking times was selected and the life span (hours) of each roll was recorded as follows:
Length of time 18 13 18 15 10 12 8 4
Life span 23 20 18 16 14 11 10 7
a) Calculate the Pearson’s correlation coefficient and explain the result [10 Marks

Answer by math_tutor2020(3817) About Me  (Show Source):
You can put this solution on YOUR website!

Answer: 0.8703
This value is approximate.

Explanation

This value can be quickly calculated using technology.
Here are some options:
  • Use the LinReg command on a TI83 (or similar). Make sure that Diagnostic is set to "on".
  • Use the the CORREL command in a spreadsheet.
  • Input the data into the spreadsheet mode of GeoGebra. Refer to this video for an example.
  • Search out "pearson correlation coefficient calculator" to find many free tools online. Two search results I found were this and this (among many other results).
  • Use a programming language such as R or Python.
  • There are many other technology options. I encourage the student to explore his/her favorite.
If you aren't allowed to use a specialized correlation coefficient calculator, then check out any of the three methods shown below.

----------------------------------------------------------------------------------------
Method 1

Define the following
x = length of baking time (in minutes)
y = life span (in hours)

The given table is
x18131815101284
y232018161411107


There are n = 8 columns of values.
In other words, there are 8 pairs of (x,y) values.

Add up the x values and divide by n = 8 to find the sample mean of the x values.
xbar = sample mean of x values = x with a horizontal bar over top
xbar = (sum of x values)/(number of x values)
xbar = (18+13+18+15+10+12+8+4)/8
xbar = 98/8
xbar = 12.25
If you are using spreadsheet software, then the function you can use is called Average

Apply the same idea to the y values.
ybar = (sum of y values)/(number of y values)
ybar = (23+20+18+16+14+11+10+7)/8
ybar = 119/8
ybar = 14.875

Introduce a new row labeled xy. This is the product of the x and y values.
For example, we'll have x*y = 18*23 = 414 as the first entry of this row.
x18131815101284
y232018161411107
xy4142603242401401328028

Add up the items in this new row to get Sum(xy) = 414+260+324+240+140+132+80+28 = 1618
Spreadsheet software is strongly recommended.

The last two things we need are:
SD(x)
SD(y)
which represent the sample standard deviations of the x and y values respectively.

You could find these by hand (see part d of this page) but I'll use technology to speed things up.
SD(x) = 4.862392
SD(y) = 5.409978
These values are approximate. I rounded each to 6 decimal places.
Make sure you use the sample standard deviation and NOT the population standard deviation.


Let's calculate the correlation coefficient.






Many textbooks will round to 4 decimal places. If your teacher instructs otherwise, then be sure to follow said instructions.

----------------------------------------------------------------------------------------
Method 2

Return back to this table.
x18131815101284
y232018161411107
xy4142603242401401328028


Let's introduce rows labeled x^2 and y^2.
As their name suggests, we'll square each value.
Eg: x^2 = 18^2 = 324
x18131815101284
y232018161411107
xy4142603242401401328028
x^23241693242251001446416
y^252940032425619612110049


Define the following
A = sum of the x values = 98
B = sum of the y values = 119
C = sum of the xy values = 1618
D = sum of the x^2 values = 1366
E = sum of the y^2 values = 1975
Spreadsheet software can quickly compute each of these.

We can now compute the correlation coefficient.






----------------------------------------------------------------------------------------
Method 3

Let's return back to the original table.
x18131815101284
y232018161411107


Recall these previous calculations
xbar = 12.25
SD(x) = 4.862392

We'll use them to determine the z score of each x value.
For instance, if x = 18, then,
z = (x-xbar)/( SD(x) )
z = (18-12.25)/( 4.862392 )
z = 1.182545545485
z = 1.182546
Repeat similar calculations for the other x values. This new row will be labeled Zx to indicate the z scores of the x values.
x18131815101284
y232018161411107
Zx1.1825460.1542451.1825460.565565-0.462735-0.051415-0.874055-1.696696


For the y values we have
ybar = 14.875
SD(y) = 5.409978 (approximate)
Here's what it looks like to introduce the z scores for the y values.
x18131815101284
y232018161411107
Zx1.1825460.1542451.1825460.565565-0.462735-0.051415-0.874055-1.696696
Zy1.5018550.9473240.5776360.207949-0.161738-0.716269-0.901113-1.455644

As mentioned earlier, spreadsheet software is strongly recommended.

The next thing we'll do is multiply each Zx and Zy value to form a new row.
Example: 1.182546*1.501855 = 1.776013 approximately
x18131815101284
y232018161411107
Zx1.1825460.1542451.1825460.565565-0.462735-0.051415-0.874055-1.696696
Zy1.5018550.9473240.5776360.207949-0.161738-0.716269-0.901113-1.455644
Zx*Zy1.7760130.146120.6830810.1176090.0748420.0368270.7876222.469785

The decimal values are approximate.

The values in that bottom row add to 6.091899 which is also approximate.

We can finally calculate the r value.
r = Sum(Zx*Zy)/(n-1)
r = 6.091899/(8-1)
r = 0.870271285714
r = 0.8703

Other methods are possible.
I'll let the student explore them.

----------------------------------------------------------------------------------------

Further Reading
https://www.statology.org/pearson-correlation-coefficient/
https://statisticsbyjim.com/basics/correlation-coefficient-formula/
https://www.statisticshowto.com/probability-and-statistics/correlation-coefficient-formula/
https://www.howtoexcel.org/correlation-coefficient/

More Practice
https://www.algebra.com/algebra/homework/word/evaluation/Evaluation_Word_Problems.faq.question.1196643.html