Basis for Margin of Error in Opinion Polls?

  • Context: Undergrad 
  • Thread starter Thread starter Bacle
  • Start date Start date
  • Tags Tags
    Basis Error Margin
Click For Summary

Discussion Overview

The discussion centers on the basis for the margin of error in opinion polls, exploring the statistical principles behind it, including binomial and multinomial distributions, confidence intervals, and the relationship between variance and standard deviation. The scope includes theoretical aspects of statistical analysis as applied to polling data.

Discussion Character

  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants propose that the margin of error is derived from the assumption of a binomial distribution, with the error being proportional to the square root of the sample size.
  • Others question whether the binomial model extends to multinomial scenarios, suggesting that the same proportionality may hold.
  • One participant discusses the basic statistical properties of the binomial distribution, including mean and variance, and how these relate to polling data.
  • Another participant raises a question about the relationship between error, variance, and standard deviation, indicating uncertainty about the basic concepts.
  • Some participants assert that the margin of error is a confidence interval, which relies on the assumption of a normal distribution, and emphasize the importance of checking this assumption using chi-square tests.
  • There is a discussion about the calculation of standard error and confidence intervals, with references to Z-scores corresponding to confidence levels.
  • One participant challenges the assumption that the margin of error corresponds to a confidence interval, suggesting it may represent the spread for a specific confidence level, particularly in a Gaussian distribution.
  • Another participant notes that while the binomial distribution provides parameter values, the Gaussian distribution is often used for practical estimation of distribution values.

Areas of Agreement / Disagreement

Participants express multiple competing views regarding the relationship between margin of error, confidence intervals, and the underlying statistical distributions. The discussion remains unresolved with no clear consensus on the assumptions or interpretations presented.

Contextual Notes

There are limitations regarding the assumptions made about distributions, the dependence on sample sizes, and the conditions under which the various statistical models apply. These aspects remain unresolved within the discussion.

Bacle
Messages
656
Reaction score
1
Hi,

Just curious as to what is the basis of the margin of error given in polls, e.g.,
in statements of the form:" 30% of people are in favor of candidate x. The poll
has a margin of error of +/- 5 %"

Thanks.
 
Physics news on Phys.org
The error margin is obtained from an assumption of a binomial distribution. The net result is the estimated error is proportional to the square root of the sample size. For example if 10000 people are asked, the error estimate is around 100, which if the answers are split 5000 yes and 5000 no, the error estimate would be about 2%.
 
Thanks, M:

Would the result extend to a multinomial if those polled were asked to choose between
more than two alternatives?

Thanks.
 
Bacle said:
Thanks, M:

Would the result extend to a multinomial if those polled were asked to choose between
more than two alternatives?

Thanks.
Essentially yes. The proportion to the square root of the sample size remains,
 
To get back to basics. Binomial distribution with sample size n and probability of success p, the mean number of successes is np and the variance (square of the standard deviation) is np(1-p). When doing a statistical analysis, p is unknown, so it is usually estimated as the number of successes divided by the sample size.

When used to analyze poling data, the simplest way is to use the binomial for each choice. For example if there are 3 choices with probabilities x, y, z, then the variances can be estimated as nx(1-x), ny(1-y), and nz(1-z).
 
Thanks Again.

When you talk about error, is it related to the variance/standard deviation?
Please ignore my ignorance--and the question-- if this is too basic/obvious.
 
Never mind the last question and thanks; sorry for being so slow.
 
The margin of error given is actually a confidence interval, I think. The assumption for a confidence interval is that the data follows a normal (not binomial) distribution, which is an assumption that should be checked using chi-square first.

The standard error (we'll call it SE) equals the standard deviation (the square root of the variance, which (in this case) = sample size * frequency of x) divided by the square root of the sample size.

The confidence interval is z * the standard error, where z is the Z-score that corresponds to the confidence level of your choice. For example, if your confidence level is 95%, then z = 1.96.
 
moonman239 said:
The margin of error given is actually a confidence interval, I think.
Is it really? I think it is the spread corresponding to a specific confidence level. So, if the distribution is roughly Gaussian, then for a 95% confidence, the MoE is about 2 standard deviations.
 
  • #10
moonman239 said:
The margin of error given is actually a confidence interval, I think. The assumption for a confidence interval is that the data follows a normal (not binomial) distribution, which is an assumption that should be checked using chi-square first.

The standard error (we'll call it SE) equals the standard deviation (the square root of the variance, which (in this case) = sample size * frequency of x) divided by the square root of the sample size.

The confidence interval is z * the standard error, where z is the Z-score that corresponds to the confidence level of your choice. For example, if your confidence level is 95%, then z = 1.96.

Gaussian is assumed because for large sample size, binomial is much harder to handle. In practice the average and the estimated deviation are used. The binomial gives the value for the parameters while the Gaussian is used to estimate distribution values.
 

Similar threads

  • · Replies 72 ·
3
Replies
72
Views
7K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 4 ·
Replies
4
Views
992
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 21 ·
Replies
21
Views
3K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 3 ·
Replies
3
Views
6K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K