Adding two set of samples with standard deviation confusions

mattkunq · Jul 22, 2012

Hello People,
I'm and just somewhat confused about this topic.
Lets say I have a sample set A with sample size= 101 mean =10 and sample stdv of 1

then sample set B with sample size= 100 Mean=15 with sample stdv of 2.

If i add these two samples sets together I should get a new stdv that is no smaller than the smallest stdv of the two. Right?

If you don't agree please tell so I know I am wrong but if you do think that it is true, then here is my confusion.

Fourier Series and make a straight line say(y=1) a sum of different sine/cosine waves.
Now if I take the x values as the sample names and the y-axis values as the test values of that sample. I can calculate a sample stdv for each wave. If i add the whole Fourier series together and get a straight line I would get a Stdv of zero. Which is less thant whatever the individual cosine/sine waves stdv were.

Thank you =)

I took one engineering stats course.

Stephen Tashi · Jul 22, 2012

Let's skip the Fourier series and look at a simpler example.

Suppose the measurements in set A are {A1=-2, A2=0, A3=2}
and the measurements in set B are {B1 = 2, B2 = 0, B3= -2}
Then the measurements like the ones you want to consider are those in the set C given by:
{C1 = A1+B1 = 0, C2 = A2+B2 = 0, C3 =A3 + B3 = 0 }

"Standard deviation" is an ambiguous term. Among it's possible meanings are:

1. The standard deviation of a probability distribution, also called the "population standard deviation".

2. The sample standard deviation, which is a formula that specifies a function of the values of a random sample from a probability distribution.

3. A particular value of the sample standard deviation that resulted from one particular sample (e.g. a specific number instead of a function).

4. An estimator of the poplation standard deviation, wihich is some function of the values in a random sample.

5. A particular value of an estimator of the population standard deviation

6. The mean square deviation of a function f(x) computed over an interval (such as the interval of one period of a periodic function).

Is your question using the term "standard deviation" in two different ways?

mathman · Jul 23, 2012

mattkunq said:

Hello People,
I'm and just somewhat confused about this topic.
Lets say I have a sample set A with sample size= 101 mean =10 and sample stdv of 1

then sample set B with sample size= 100 Mean=15 with sample stdv of 2.

If i add these two samples sets together I should get a new stdv that is no smaller than the smallest stdv of the two. Right?

If you don't agree please tell so I know I am wrong but if you do think that it is true, then here is my confusion.

Fourier Series and make a straight line say(y=1) a sum of different sine/cosine waves.
Now if I take the x values as the sample names and the y-axis values as the test values of that sample. I can calculate a sample stdv for each wave. If i add the whole Fourier series together and get a straight line I would get a Stdv of zero. Which is less thant whatever the individual cosine/sine waves stdv were.

Thank you =)

I took one engineering stats course.

The best way to get the combined mean and standard deviation is to reconstruct the original sum and sum of squares for each set of samples and then combine them to compute a mean and standard deviation for the totality of samples. My calculation led to a mean of 12.49 and a standard deviation of 2.957. Qualitatively the standard deviation is larger because of the significant difference in the means of the two sets of samples.

mattkunq · Jul 23, 2012

Hello People,
I have figured it out. When I say standard deviation I mean the follow equation is applied for the sample:
[1/(n-1)]*[(sum of x^2)-n*samplemean^2)]

So my confusion was the difference between combining two sample sets together and adding the two sample outputs respectively. Essentially stacking them. Like adding two inverted colored checker boards to one homogenous sheet of color.

As appose to puting the two different checkers side by side and evaluate the grey level sample standard deviation of that.

Thanks though! =)

blue_raver22 · Jul 30, 2012

I would like to clarify a few points regarding your question. First, when adding two sets of samples, the standard deviation of the combined set will depend on the correlation between the two sets. If the two sets are positively correlated, the standard deviation of the combined set will be larger than the smallest standard deviation of the two sets. However, if the two sets are negatively correlated, the standard deviation of the combined set will be smaller than the smallest standard deviation of the two sets.

In your example, it is not clear whether the two sets A and B are correlated or not. If they are not correlated, then your statement is correct. However, if they are correlated, the standard deviation of the combined set will depend on the strength of the correlation.

Regarding your confusion about the Fourier series, it is important to note that the standard deviation is a measure of the spread of a set of data points around the mean. In the case of a straight line, there is no spread of data points, hence the standard deviation is zero. However, this does not mean that the individual cosine/sine waves have a standard deviation of zero. Each wave may have a different standard deviation, and when combined, the overall standard deviation will depend on the correlation between the waves.

I hope this clarifies your confusion. It is important to carefully consider the correlation between data sets when adding them together and interpreting the resulting standard deviation. Additionally, the standard deviation is just one measure of variability and should be considered alongside other measures when analyzing data.

Adding two set of samples with standard deviation confusions

What is the purpose of adding two sets of samples with standard deviation confusions?

How do you calculate the standard deviation of the combined set?

What are some common mistakes when adding two sets of samples with standard deviation confusions?

Can you add two sets of samples with different sample sizes and standard deviations?

How does adding two sets of samples with standard deviation confusions help in scientific research?

Similar threads

Hot Threads

Recent Insights