SOLUTION: TABLE: https://imagizer.imageshack.com/img924/3557/S7r4QJ.jpg Disk drives have been getting larger. Their capacity is now often given in terabytes​ (TB) where 1TB=1000 ​giga

Algebra.Com
Question 1196643: TABLE: https://imagizer.imageshack.com/img924/3557/S7r4QJ.jpg
Disk drives have been getting larger. Their capacity is now often given in terabytes​ (TB) where 1TB=1000 ​gigabytes, or about a trillion bytes. A search of prices for external disk drives on a large shopping website in a recent year found the accompanying data. Find and interpret the value of R^2

Answer by math_tutor2020(3816)   (Show Source): You can put this solution on YOUR website!

If you are in a hurry, then you can use technology to quickly compute the r and r^2 values.
Two examples would be the LinReg command on a TI83 (or similar) and using the the CORREL command in a spreadsheet.
There are many other options to choose from. Feel free to search out your favorite.
You should find these approximations
r = 0.9878
r^2 = 0.9757
Since r^2 is very close to 1, this makes the linear regression a good fit. Approximately 97.57% of the variation in x explains the variation in y.


If you have more time to go over the math, then there are various ways to calculate the correlation coefficient.
I'll go over two slightly different methods.

-------------------------------------------------------------------------------------------------------------

Method 1

x = capacity of hard drive in terabytes (TB)
y = price in dollars

Given info:
n = 9 = sample size = number of x,y pairs
xbar = 7.611
ybar = 786.49
SD(x) = 9.854
SD(y) = 1417.82

The term "xbar" refers to the horizontal bar over the x, i.e.
A similar story is with ybar as well.

Given Data
xy
0.560.99
177.99
2112.97
3110.99
4151.99
6425.34
8597.11
121081.99
324459
Form a third column which is the product of the x and y columns
Eg: 0.5*60.99 = 30.495 in the first row
xyxy
0.560.9930.495
177.9977.99
2112.97225.94
3110.99332.97
4151.99607.96
6425.342552.04
8597.114776.88
121081.9912983.88
324459142688
I strongly recommend using spreadsheet software.
It's not only fast and efficient, but also something that is expected in real world applications.
I'm using LibreOffice but you could use Excel or Google Sheets or whichever app you prefer most.

Add up the values in the xy column to get 164,276.155
Then we subtract off the value of n*xbar*ybar = 9*7.611*786.49 = 53,873.77851

So we have:
Sum(xy) - n*xbar*ybar = 164,276.155 - 53,873.77851 = 110,402.37649

We'll divide that result over the product of the given standard deviation values, multiplied with (n-1)
So,
(n-1)*SD(x)*SD(y) = (9-1)*(9.854)*(1417.82) = 111,769.58624


Therefore,
r = (110,402.37649)/(111,769.58624)
r = 0.98776760480204
r^2 = (0.98776760480204)^2
r^2 = 0.97568484109636
r^2 = 0.9757
which is approximate.

Since r^2 is very close to 1, this makes the linear regression a good fit. Approximately 97.57% of the variation in x explains the variation in y.

Note: The formula I used just now is


-------------------------------------------------------------------------------------------------------------

Method 2

x = capacity of hard drive in terabytes (TB)
y = price in dollars

Given info:
n = 9 = sample size = number of x,y pairs
xbar = 7.611
ybar = 786.49
SD(x) = 9.854
SD(y) = 1417.82

Given Data
xy
0.560.99
177.99
2112.97
3110.99
4151.99
6425.34
8597.11
121081.99
324459
Instead of an xy column, we'll form the Zx column
Zx = (x - xbar)/(SD(x))
We're computing the z score for each x term

For instance, in the first row we have
Zx = (x-xbar)/(SD(x))
Zx = (0.5-7.611)/(9.854)
Zx = -0.72163588390501
Zx = -0.721636
Do the same thing for each item in the x column. The values of xbar and SD(x) will remain constant.

This is what the updated table looks like
xyZx
0.560.99-0.721636
177.99-0.670895
2112.97-0.569413
3110.99-0.467932
4151.99-0.36645
6425.34-0.163487
8597.110.039476
121081.990.445403
3244592.475036
Follow similar steps for the Zy column
For example, we'll have the following calculation for the 1st row.
Zy = (y - ybar)/(SD(y))
Zy = (60.99 - 786.49)/(1417.82)
Zy = -0.511701

We have this so far
xyZxZy
0.560.99-0.721636-0.511701
177.99-0.670895-0.499711
2112.97-0.569413-0.475039
3110.99-0.467932-0.476436
4151.99-0.36645-0.447518
6425.34-0.163487-0.254722
8597.110.039476-0.133571
121081.990.4454030.208419
3244592.4750362.590251
Then we'll multiply the Zx and Zy items for each row.
Eg: Zx*Zy = (-0.721636)*(-0.511701) = 0.369262 in row one

This is what the fully completed table looks like
xyZxZyZxZy
0.560.99-0.721636-0.5117010.369262
177.99-0.670895-0.4997110.335254
2112.97-0.569413-0.4750390.270493
3110.99-0.467932-0.4764360.22294
4151.99-0.36645-0.4475180.163993
6425.34-0.163487-0.2547220.041644
8597.110.039476-0.133571-0.005273
121081.990.4454030.2084190.09283
3244592.4750362.5902516.410964

Add up the values in the final column and you should get roughly 7.902107

So,
r = Sum(ZxZy)/(n-1)
r = 7.902107/(9-1)
r = 7.902107/8
r = 0.987763375
r^2 = (0.987763375)^2
r^2 = 0.9756764849914
r^2 = 0.9757

Since r^2 is very close to 1, this makes the linear regression a good fit. Approximately 97.57% of the variation in x explains the variation in y.

Answer: r^2 = 0.9757 approximately

RELATED QUESTIONS

DATA TABLE : https://imagizer.imageshack.com/img923/3653/Su1KFx.jpg Disk drives have... (answered by Theo)
QUESTIONS: https://imagizer.imageshack.com/img924/1530/bpFkzk.png... (answered by onyulee)
DATA QUESTIONS: https://imagizer.imageshack.com/img924/2199/1BaYnT.png... (answered by onyulee)
ATTACHED TABLE: https://imagizer.imageshack.com/img924/2894/DRMZfW.jpg A company must... (answered by Theo)
PICTURE FOR PROBLEM https://imagizer.imageshack.com/img924/5244/Jq14To.jpg In a year... (answered by ElectricPavlov)
THE LINK FOR THE CONTINGENCY TABLE :... (answered by ElectricPavlov)
ATTACHED IS THE CHART:https://imagizer.imageshack.com/img924/482/OjKL0M.jpg As the... (answered by MathLover1)
HERE IS THE TABLE: https://imagizer.imageshack.com/img923/3314/zi0hjD.jpg A sample... (answered by math_tutor2020)
HERE IS THE IMAGE LINK: https://imagizer.imageshack.com/img922/521/KvHWN3.jpg Data... (answered by Theo)