Statistics Sum of Squares x*y Proof

Click For Summary

Homework Help Overview

The discussion revolves around proving an identity involving the sum of products of deviations from the mean in statistics, specifically focusing on the relationship between the sums of products of two variables, \(x\) and \(y\).

Discussion Character

  • Exploratory, Mathematical reasoning, Assumption checking

Approaches and Questions Raised

  • Participants attempt to expand the sum of products and manipulate the expressions to find a proof. Some express confusion about the correctness of their expansions and seek clarification on specific terms. Others suggest examining small examples to better understand the relationships involved.

Discussion Status

There is ongoing exploration of the mathematical relationships between the sums and means of the variables. Some participants have made progress in their proofs and are discussing the implications of their findings, while others are still seeking clarity on certain steps and concepts.

Contextual Notes

Participants are working under the constraints of homework rules, which may limit the amount of direct assistance they can receive. There is a focus on understanding the underlying principles rather than simply arriving at a solution.

laz0r
Messages
17
Reaction score
0

Homework Statement



Prove that

\sum[(x_{i} - \overline{x})(y_{i} - \overline{y})] = \sum[(x_{i} - \overline{x})y_{i}]


Homework Equations


None.


The Attempt at a Solution


I tried using the fact that the sum of the mean values is just the mean value, because the sum of a constant is simply a constant, and I expanded out the first sum, but I didn't end up anywhere.

\sum[x_{i}y_{i}] - \overline{y}\sum(x_{i}) - \overline{x}\sum(y_{i}) + \overline(x)\overline(y)

I have no idea where to go from here, so some inspiration would be much appreciated.
 
Physics news on Phys.org
laz0r said:

Homework Statement



Prove that

\sum[(x_{i} - \overline{x})(y_{i} - \overline{y})] = \sum[(x_{i} - \overline{x})y_{i}]


Homework Equations


None.


The Attempt at a Solution


I tried using the fact that the sum of the mean values is just the mean value, because the sum of a constant is simply a constant, and I expanded out the first sum, but I didn't end up anywhere.

\sum[x_{i}y_{i}] - \overline{y}\sum(x_{i}) - \overline{x}\sum(y_{i}) + \overline(x)\overline(y)

I have no idea where to go from here, so some inspiration would be much appreciated.

I assume you have n values ##x_1, \ldots, x_n## and ##y_1, \ldots,y_n##. In that case your last term above, ##\bar{x} \bar{y}## is incorrect. Can You see why? You can also do other simplifications, but if I say more I will essentially be doing the question for you.
 
laz0r said:

Homework Statement



Prove that

\sum[(x_{i} - \overline{x})(y_{i} - \overline{y})] = \sum[(x_{i} - \overline{x})y_{i}]


Homework Equations


None.


The Attempt at a Solution


I tried using the fact that the sum of the mean values is just the mean value, because the sum of a constant is simply a constant, and I expanded out the first sum, but I didn't end up anywhere.

\sum[x_{i}y_{i}] - \overline{y}\sum(x_{i}) - \overline{x}\sum(y_{i}) + \overline(x)\overline(y)

I have no idea where to go from here, so some inspiration would be much appreciated.

Notice that your first and third terms give what is on the right side of the equation. So you are left with showing$$
-\sum x_i\bar y + \sum \bar y \bar x = 0$$You're pretty close...
 
Ok, so I actually did end up getting somewhere and didn't realize it =)

EDIT:

I've done proof

\sum[(x_{i}-\overline{x})y_{i}] + \sum[\overline{y}(\overline{x} - x_{i})]

Playing around in excel has taught me that \sum[\overline{y}(\overline{x} - x_{i})] is actually just equal to zero, because your subtracting an array of x values from the mean and then just multiplying them by a constant, \overline{y}, then adding them, which nets you zero.
 
Last edited:
laz0r said:
Ok, so I actually did end up getting somewhere and didn't realize it =)

I've done most of the proof and I've gotten up until this point

\sum[(x_{i}-\overline{x})y_{i}] + \sum[\overline{y}(\overline{x} - x_{i})]

So I take it that either \overline{y} or (\overline{x} - x_{i}) must equal zero, but I'm not entirely sure why.

Try writing out a few small examples, such as for n = 2 or n = 3. These are small enough that you can write down everything explicitly and see exactly what is going on.
 
  • Like
Likes   Reactions: 1 person
I've figured out why, but I'm not sure how to explain it symbolically, do you think I would need to elaborate more or is my edited explanation good enough in your view?

Thanks for the help!
 
##\bar x## and ##\sum x_i## are related to each other. How?
 
  • Like
Likes   Reactions: 1 person
LCKurtz said:
##\bar x## and ##\sum x_i## are related to each other. How?

I'm a little bit rusty on my operation of sums, so excuse me if this is incorrect, but can I do this?

<br /> <br /> [(\overline{y})/n][\sum x_{i}] - \overline{y}[\sum x_{i}]<br /> <br /> = [(\overline{y}^2)/n][\sum(x_{i} - x_{i})]<br /> <br />

Then what tends to zero appears to be obvious
 
LCKurtz said:
##\bar x## and ##\sum x_i## are related to each other. How?

laz0r said:
I'm a little bit rusty on my operation of sums, so excuse me if this is incorrect, but can I do this?

<br /> <br /> [(\overline{y})/n][\sum x_{i}] - \overline{y}[\sum x_{i}]<br /> <br /> = [(\overline{y}^2)/n][\sum(x_{i} - x_{i})]<br /> <br />

Then what tends to zero appears to be obvious

I don't follow that at all. Why don't you just answer my question at the top?
 
  • #10
LCKurtz said:
I don't follow that at all. Why don't you just answer my question at the top?

They're related by

<br /> <br /> \overline{x} = [\sum x_{i}]/n<br /> <br />
 
  • #11
laz0r said:
They're related by

<br /> <br /> \overline{x} = [\sum x_{i}]/n<br /> <br />

So you could use ##\sum x_i = n\bar x##. Do you see how you might use that to get$$
\sum \bar y(\bar x - x_i) = 0\text{?}$$
 
  • #12
LCKurtz said:
So you could use ##\sum x_i = n\bar x##. Do you see how you might use that to get$$
\sum \bar y(\bar x - x_i) = 0\text{?}$$

Wow, it appears my brain took a nap today.

Thank you for your help lol, I appreciate it.
 
  • #13
So amuse me and show the finishing steps...
 
  • #14
Sorry, I'm very slow at typing this latex code..

<br /> \overline{y}*[\sum(\overline{x} - x_{i})] = 0<br />

<br /> [\sum(\overline{x} - x_{i})] = 0<br />

<br /> \sum(\overline{x}) - \sum(x_{i}) = 0<br />

<br /> n(\overline{x}) - n(\overline{x}) = 0<br />

LHS = RHS

used

<br /> \sum(c) = n*c<br />

Where n is the upper bound and 1 is the lower bound of the sum
 
  • #15
Good. Thanks for finishing it up.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K