Statistics Sum of Squares x*y Proof

laz0r
Messages
17
Reaction score
0

Homework Statement



Prove that

\sum[(x_{i} - \overline{x})(y_{i} - \overline{y})] = \sum[(x_{i} - \overline{x})y_{i}]


Homework Equations


None.


The Attempt at a Solution


I tried using the fact that the sum of the mean values is just the mean value, because the sum of a constant is simply a constant, and I expanded out the first sum, but I didn't end up anywhere.

\sum[x_{i}y_{i}] - \overline{y}\sum(x_{i}) - \overline{x}\sum(y_{i}) + \overline(x)\overline(y)

I have no idea where to go from here, so some inspiration would be much appreciated.
 
Physics news on Phys.org
laz0r said:

Homework Statement



Prove that

\sum[(x_{i} - \overline{x})(y_{i} - \overline{y})] = \sum[(x_{i} - \overline{x})y_{i}]


Homework Equations


None.


The Attempt at a Solution


I tried using the fact that the sum of the mean values is just the mean value, because the sum of a constant is simply a constant, and I expanded out the first sum, but I didn't end up anywhere.

\sum[x_{i}y_{i}] - \overline{y}\sum(x_{i}) - \overline{x}\sum(y_{i}) + \overline(x)\overline(y)

I have no idea where to go from here, so some inspiration would be much appreciated.

I assume you have n values ##x_1, \ldots, x_n## and ##y_1, \ldots,y_n##. In that case your last term above, ##\bar{x} \bar{y}## is incorrect. Can You see why? You can also do other simplifications, but if I say more I will essentially be doing the question for you.
 
laz0r said:

Homework Statement



Prove that

\sum[(x_{i} - \overline{x})(y_{i} - \overline{y})] = \sum[(x_{i} - \overline{x})y_{i}]


Homework Equations


None.


The Attempt at a Solution


I tried using the fact that the sum of the mean values is just the mean value, because the sum of a constant is simply a constant, and I expanded out the first sum, but I didn't end up anywhere.

\sum[x_{i}y_{i}] - \overline{y}\sum(x_{i}) - \overline{x}\sum(y_{i}) + \overline(x)\overline(y)

I have no idea where to go from here, so some inspiration would be much appreciated.

Notice that your first and third terms give what is on the right side of the equation. So you are left with showing$$
-\sum x_i\bar y + \sum \bar y \bar x = 0$$You're pretty close...
 
Ok, so I actually did end up getting somewhere and didn't realize it =)

EDIT:

I've done proof

\sum[(x_{i}-\overline{x})y_{i}] + \sum[\overline{y}(\overline{x} - x_{i})]

Playing around in excel has taught me that \sum[\overline{y}(\overline{x} - x_{i})] is actually just equal to zero, because your subtracting an array of x values from the mean and then just multiplying them by a constant, \overline{y}, then adding them, which nets you zero.
 
Last edited:
laz0r said:
Ok, so I actually did end up getting somewhere and didn't realize it =)

I've done most of the proof and I've gotten up until this point

\sum[(x_{i}-\overline{x})y_{i}] + \sum[\overline{y}(\overline{x} - x_{i})]

So I take it that either \overline{y} or (\overline{x} - x_{i}) must equal zero, but I'm not entirely sure why.

Try writing out a few small examples, such as for n = 2 or n = 3. These are small enough that you can write down everything explicitly and see exactly what is going on.
 
  • Like
Likes 1 person
I've figured out why, but I'm not sure how to explain it symbolically, do you think I would need to elaborate more or is my edited explanation good enough in your view?

Thanks for the help!
 
##\bar x## and ##\sum x_i## are related to each other. How?
 
  • Like
Likes 1 person
LCKurtz said:
##\bar x## and ##\sum x_i## are related to each other. How?

I'm a little bit rusty on my operation of sums, so excuse me if this is incorrect, but can I do this?

<br /> <br /> [(\overline{y})/n][\sum x_{i}] - \overline{y}[\sum x_{i}]<br /> <br /> = [(\overline{y}^2)/n][\sum(x_{i} - x_{i})]<br /> <br />

Then what tends to zero appears to be obvious
 
LCKurtz said:
##\bar x## and ##\sum x_i## are related to each other. How?

laz0r said:
I'm a little bit rusty on my operation of sums, so excuse me if this is incorrect, but can I do this?

<br /> <br /> [(\overline{y})/n][\sum x_{i}] - \overline{y}[\sum x_{i}]<br /> <br /> = [(\overline{y}^2)/n][\sum(x_{i} - x_{i})]<br /> <br />

Then what tends to zero appears to be obvious

I don't follow that at all. Why don't you just answer my question at the top?
 
  • #10
LCKurtz said:
I don't follow that at all. Why don't you just answer my question at the top?

They're related by

<br /> <br /> \overline{x} = [\sum x_{i}]/n<br /> <br />
 
  • #11
laz0r said:
They're related by

<br /> <br /> \overline{x} = [\sum x_{i}]/n<br /> <br />

So you could use ##\sum x_i = n\bar x##. Do you see how you might use that to get$$
\sum \bar y(\bar x - x_i) = 0\text{?}$$
 
  • #12
LCKurtz said:
So you could use ##\sum x_i = n\bar x##. Do you see how you might use that to get$$
\sum \bar y(\bar x - x_i) = 0\text{?}$$

Wow, it appears my brain took a nap today.

Thank you for your help lol, I appreciate it.
 
  • #13
So amuse me and show the finishing steps...
 
  • #14
Sorry, I'm very slow at typing this latex code..

<br /> \overline{y}*[\sum(\overline{x} - x_{i})] = 0<br />

<br /> [\sum(\overline{x} - x_{i})] = 0<br />

<br /> \sum(\overline{x}) - \sum(x_{i}) = 0<br />

<br /> n(\overline{x}) - n(\overline{x}) = 0<br />

LHS = RHS

used

<br /> \sum(c) = n*c<br />

Where n is the upper bound and 1 is the lower bound of the sum
 
  • #15
Good. Thanks for finishing it up.
 

Similar threads

Back
Top