Adding two set of samples with standard deviation confusions

In summary, the conversation discusses the concept of standard deviation and its various meanings and applications. The individual's confusion is clarified through a simple example and by distinguishing between combining two sample sets and adding the two sample outputs. The conversation ends with the individual expressing gratitude for the discussion.
  • #1
mattkunq
14
0
Hello People,
I'm and just somewhat confused about this topic.
Lets say I have a sample set A with sample size= 101 mean =10 and sample stdv of 1

then sample set B with sample size= 100 Mean=15 with sample stdv of 2.

If i add these two samples sets together I should get a new stdv that is no smaller than the smallest stdv of the two. Right?

If you don't agree please tell so I know I am wrong but if you do think that it is true, then here is my confusion.

Fourier Series and make a straight line say(y=1) a sum of different sine/cosine waves.
Now if I take the x values as the sample names and the y-axis values as the test values of that sample. I can calculate a sample stdv for each wave. If i add the whole Fourier series together and get a straight line I would get a Stdv of zero. Which is less thant whatever the individual cosine/sine waves stdv were.

Thank you =)

I took one engineering stats course.
 
Physics news on Phys.org
  • #2
Let's skip the Fourier series and look at a simpler example.

Suppose the measurements in set A are {A1=-2, A2=0, A3=2}
and the measurements in set B are {B1 = 2, B2 = 0, B3= -2}
Then the measurements like the ones you want to consider are those in the set C given by:
{C1 = A1+B1 = 0, C2 = A2+B2 = 0, C3 =A3 + B3 = 0 }

"Standard deviation" is an ambiguous term. Among it's possible meanings are:

1. The standard deviation of a probability distribution, also called the "population standard deviation".

2. The sample standard deviation, which is a formula that specifies a function of the values of a random sample from a probability distribution.

3. A particular value of the sample standard deviation that resulted from one particular sample (e.g. a specific number instead of a function).

4. An estimator of the poplation standard deviation, wihich is some function of the values in a random sample.

5. A particular value of an estimator of the population standard deviation

6. The mean square deviation of a function f(x) computed over an interval (such as the interval of one period of a periodic function).

Is your question using the term "standard deviation" in two different ways?
 
  • #3
mattkunq said:
Hello People,
I'm and just somewhat confused about this topic.
Lets say I have a sample set A with sample size= 101 mean =10 and sample stdv of 1

then sample set B with sample size= 100 Mean=15 with sample stdv of 2.

If i add these two samples sets together I should get a new stdv that is no smaller than the smallest stdv of the two. Right?

If you don't agree please tell so I know I am wrong but if you do think that it is true, then here is my confusion.

Fourier Series and make a straight line say(y=1) a sum of different sine/cosine waves.
Now if I take the x values as the sample names and the y-axis values as the test values of that sample. I can calculate a sample stdv for each wave. If i add the whole Fourier series together and get a straight line I would get a Stdv of zero. Which is less thant whatever the individual cosine/sine waves stdv were.

Thank you =)

I took one engineering stats course.
The best way to get the combined mean and standard deviation is to reconstruct the original sum and sum of squares for each set of samples and then combine them to compute a mean and standard deviation for the totality of samples. My calculation led to a mean of 12.49 and a standard deviation of 2.957. Qualitatively the standard deviation is larger because of the significant difference in the means of the two sets of samples.
 
  • #4
Hello People,
I have figured it out. When I say standard deviation I mean the follow equation is applied for the sample:
[1/(n-1)]*[(sum of x^2)-n*samplemean^2)]

So my confusion was the difference between combining two sample sets together and adding the two sample outputs respectively. Essentially stacking them. Like adding two inverted colored checker boards to one homogenous sheet of color.

As appose to puting the two different checkers side by side and evaluate the grey level sample standard deviation of that.

Thanks though! =)
 
  • #5


I would like to clarify a few points regarding your question. First, when adding two sets of samples, the standard deviation of the combined set will depend on the correlation between the two sets. If the two sets are positively correlated, the standard deviation of the combined set will be larger than the smallest standard deviation of the two sets. However, if the two sets are negatively correlated, the standard deviation of the combined set will be smaller than the smallest standard deviation of the two sets.

In your example, it is not clear whether the two sets A and B are correlated or not. If they are not correlated, then your statement is correct. However, if they are correlated, the standard deviation of the combined set will depend on the strength of the correlation.

Regarding your confusion about the Fourier series, it is important to note that the standard deviation is a measure of the spread of a set of data points around the mean. In the case of a straight line, there is no spread of data points, hence the standard deviation is zero. However, this does not mean that the individual cosine/sine waves have a standard deviation of zero. Each wave may have a different standard deviation, and when combined, the overall standard deviation will depend on the correlation between the waves.

I hope this clarifies your confusion. It is important to carefully consider the correlation between data sets when adding them together and interpreting the resulting standard deviation. Additionally, the standard deviation is just one measure of variability and should be considered alongside other measures when analyzing data.
 

What is the purpose of adding two sets of samples with standard deviation confusions?

The purpose of adding two sets of samples with standard deviation confusions is to determine the combined effect of the two sets of data. This can help to identify any patterns or relationships between the two sets of data and determine if they have a significant impact on each other.

How do you calculate the standard deviation of the combined set?

To calculate the standard deviation of the combined set, you will need to first find the mean of the combined data. Then, for each data point in the combined set, subtract the mean and square the result. Next, find the sum of all of these squared values and divide by the total number of data points. Finally, take the square root of this value to find the standard deviation.

What are some common mistakes when adding two sets of samples with standard deviation confusions?

Some common mistakes when adding two sets of samples with standard deviation confusions include forgetting to include all of the data points in the calculation, using the wrong formula, or rounding too early in the calculation process. It is important to carefully follow the steps for calculating standard deviation to avoid these mistakes.

Can you add two sets of samples with different sample sizes and standard deviations?

Yes, you can add two sets of samples with different sample sizes and standard deviations. However, it is important to note that the larger set will have a greater influence on the combined standard deviation. This means that if one set has a much larger sample size or standard deviation, it will have a larger impact on the combined standard deviation compared to the other set.

How does adding two sets of samples with standard deviation confusions help in scientific research?

Adding two sets of samples with standard deviation confusions can help in scientific research by providing a more comprehensive understanding of the data. By combining two sets of data, scientists can identify any potential relationships or patterns that may have been missed when looking at each set separately. This can lead to more accurate conclusions and further insights into the research topic.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
912
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
986
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
477
  • Set Theory, Logic, Probability, Statistics
Replies
24
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
723
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
18
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
1K
Back
Top