MHB Standard deviation for proportions

AI Thread Summary
To calculate the standard deviation and standard error for a binomial distribution with multiple samples, the standard deviation is estimated using the formula σ = √(p(1 - p)/n), where p is the estimated proportion. When combining samples with different sizes, a weighted average is used, with weights calculated as w_i = 1/σ_i². The weighted mean is then computed as p̄ = (Σp_i w_i)/(Σw_i), and the standard error of the weighted mean is σ_p̄ = √(1/Σw_i). For reporting results as percentages, the standard errors and standard deviations can be expressed in percentage form by multiplying the proportion values by 100. Understanding these calculations is essential for accurately reporting statistical results.
Zues
Messages
4
Reaction score
0
Hi,,,can u please explain me how to calculate standard deviation and standard error for a binomial distribution when you have several samples?

For exapmple:
I don't know the population size. I take a sample of 10 and check for a particular characteristic. Let's say number of successes for this sample is x. So the proportion of successes is x/n. Then I repeat this process 3 times. That means I take 3 samples. Then I'll calculate the mean of the x/n for these 3 samples. So how do I calculate standard deviation or standard error for this mean value?

Eg: Sample 1 => x/n = x/10 =3/10 =30%

When this is done to all three samples,

Sample 1 => 30% +- a
Sample 2 => 32% +-b
Sample 3 => 32% +- c
Mean = 31.33% +-d

How do I calculate a,b,c and d? And what if I have different sample sizes for the three occasions? (having 10, 15, 8 instead of 10,10,10).

Thank you very much for your help
 
Mathematics news on Phys.org
Zues said:
Hi,,,can u please explain me how to calculate standard deviation and standard error for a binomial distribution when you have several samples?

For exapmple:
I don't know the population size. I take a sample of 10 and check for a particular characteristic. Let's say number of successes for this sample is x. So the proportion of successes is x/n. Then I repeat this process 3 times. That means I take 3 samples. Then I'll calculate the mean of the x/n for these 3 samples. So how do I calculate standard deviation or standard error for this mean value?

Eg: Sample 1 => x/n = x/10 =3/10 =30%

When this is done to all three samples,

Sample 1 => 30% +- a
Sample 2 => 32% +-b
Sample 3 => 32% +- c
Mean = 31.33% +-d

How do I calculate a,b,c and d? And what if I have different sample sizes for the three occasions? (having 10, 15, 8 instead of 10,10,10).

Thank you very much for your help

Hi Zues! Welcome to MHB! :)

The standard deviation of the proportion of a binomial distribution is estimated with:
$$\hat \sigma = \sqrt{\frac{\hat p (1 - \hat p)}{n}}$$
where $\hat p$ is the estimated proportion.

When combining N measurements with different standard deviations, you'll need a weighted average.
The weights are:
$$w_i = \frac{1}{\hat\sigma_i^2}$$
The weighted mean is then:
$$\bar{p} = \frac{ \displaystyle\sum_{i=1}^N \hat p_i w_i}{\displaystyle\sum_{i=1}^N w_i}$$
And the standard error $\sigma_{\bar{p}}$ of the weighted mean is:
$$\sigma_{\bar{p}} = \sqrt{\frac{ 1 }{\sum_{i=1}^N w_i}}$$
 
I like Serena said:
Hi Zues! Welcome to MHB! :)

The standard deviation of the proportion of a binomial distribution is estimated with:
$$\hat \sigma = \sqrt{\frac{\hat p (1 - \hat p)}{n}}$$
where $\hat p$ is the estimated proportion.

When combining N measurements with different standard deviations, you'll need a weighted average.
The weights are:
$$w_i = \frac{1}{\hat\sigma_i^2}$$
The weighted mean is then:
$$\bar{p} = \frac{ \displaystyle\sum_{i=1}^N \hat p_i w_i}{\displaystyle\sum_{i=1}^N w_i}$$
And the standard error $\sigma_{\bar{p}}$ of the weighted mean is:
$$\sigma_{\bar{p}} = \sqrt{\frac{ 1 }{\sum_{i=1}^N w_i}}$$

Thank you very very much Serena. I spent a whole day trying to find this. Thank you very much (Smile)
 
Hi,, I have another question regarding this. I would like to report my results as percentages. Then how should I report the standard errors and standard deviations? I'm asking this because we use the proportion (instead of the percentage value) to calculate the SD an SE

Thank you very much and I'm so sorry for bothering. Thank you
 
Zues said:
Hi,, I have another question regarding this. I would like to report my results as percentages. Then how should I report the standard errors and standard deviations? I'm asking this because we use the proportion (instead of the percentage value) to calculate the SD an SE

Thank you very much and I'm so sorry for bothering. Thank you

A proportion and a percentage represent the same thing.
The only difference is a factor of a 100.

So, suppose you have a weighted mean of $\bar p = 0.31$ and an estimated standard error of $\hat \sigma_{\bar p}=0.12$, then you might also say that $\bar p = 31\%$ and $\hat \sigma_{\bar p}=12\%$.
Or for short:
$$\bar p = 31 \pm 12 \%$$
 
Thank you very much. This means a lot. You are so kind, Thank you(Smile)
 
Suppose ,instead of the usual x,y coordinate system with an I basis vector along the x -axis and a corresponding j basis vector along the y-axis we instead have a different pair of basis vectors ,call them e and f along their respective axes. I have seen that this is an important subject in maths My question is what physical applications does such a model apply to? I am asking here because I have devoted quite a lot of time in the past to understanding convectors and the dual...
Fermat's Last Theorem has long been one of the most famous mathematical problems, and is now one of the most famous theorems. It simply states that the equation $$ a^n+b^n=c^n $$ has no solutions with positive integers if ##n>2.## It was named after Pierre de Fermat (1607-1665). The problem itself stems from the book Arithmetica by Diophantus of Alexandria. It gained popularity because Fermat noted in his copy "Cubum autem in duos cubos, aut quadratoquadratum in duos quadratoquadratos, et...
Insights auto threads is broken atm, so I'm manually creating these for new Insight articles. In Dirac’s Principles of Quantum Mechanics published in 1930 he introduced a “convenient notation” he referred to as a “delta function” which he treated as a continuum analog to the discrete Kronecker delta. The Kronecker delta is simply the indexed components of the identity operator in matrix algebra Source: https://www.physicsforums.com/insights/what-exactly-is-diracs-delta-function/ by...

Similar threads

Replies
2
Views
1K
Replies
1
Views
2K
Replies
4
Views
1K
Replies
1
Views
2K
Replies
42
Views
4K
Replies
4
Views
1K
Back
Top