Binomial Distribution and Selection of Suitable Values

Soaring Crane · Feb 7, 2013

For binomial distributions, how can you tell which central tendency value (mean, median, or mode) and which variability value (interquartile range, variance, standard deviation, etc.) are most appropriate for the data?

Thanks for any reply.

chiro · Feb 7, 2013

Hey Soaring Crane.

What exactly are you trying to do? Are you trying to see if a distribution is binomial? Are you assuming its binomial to estimate its parameter?

Binomials are good for modelling sums of I.I.D Bernoulli (Yes/No, On/Off etc) type stochastic processes (i.e. random processes).

Soaring Crane · Feb 7, 2013

Thanks for replying. I am assuming it is binomial (yes/no reply), but I don't know how to determine which values regarding central tendency and variability describe it best. (For example, there are about ten "no" replies and fourteen "yes" replies.)

Stephen Tashi · Feb 7, 2013

Soaring Crane said:

which variability value (interquartile range, variance, standard deviation, etc.) are most appropriate for the data?

I think what you are asking is:

Given the data, what is the best method for estimating the parameter of the binomial distribution that fits it?

This is not a precise mathematical question until you define what "best" (or "appropriate") means.

In mathematical terms, you are seeking an "estimator" (i.e. an algorithm or formula whose input is the sample data and whose output is an estimate of the parameter p (the probability of "succsess") that defines a binomial distribution.) An estimator depends on the random values in a sample so the estimator is a random variable. Thus there is no guarantee the estimator will always be close to value you want to estimate. To say what a "best" estimator is, you must be specific about "best" means in scenario that involves random outcomes. Some common ways of expressing human tastes for "best" estimators have the technical names: "maximum liklihood", "unbiased", "minimum variance", "consistent".

So, to ask a question that has a mathematical answer, you should ask questions like:

"What is a maximum liklihood estimator for the parameter p of a binomial distribution?"

"Is there an estimator of the paramater p of a binomial distribuion that is both unbiased and has minimum variance?"

Some forum member can answer those, or you can find the answers on the web, now that the right jargon is established.

Edit: Or perhaps you don't care about p, but only about the mean and variance of the binomial distribution. In that case you should ask for estimators of those parameters. (Even though the mean and variance are both functions of p, a "best" estimator of p is not necessarily a "best" estimator of the mean or variance.)

blue_raver22 · Feb 14, 2013

The choice of central tendency and variability values for binomial distributions depends on the specific characteristics of the data and the research question being addressed. Generally, the mean is the most commonly used measure of central tendency for binomial distributions as it takes into account all data points and provides a good overall representation of the data. However, if the data is skewed or contains outliers, the median may be a more appropriate measure of central tendency as it is less affected by extreme values.

In terms of variability, the interquartile range (IQR) may be a better measure if the data is not normally distributed or contains outliers, as it is less sensitive to extreme values compared to measures such as the standard deviation or variance. However, if the data is normally distributed, the standard deviation or variance may be more appropriate measures of variability as they take into account all data points and provide a more precise measure of spread.

Ultimately, the choice of central tendency and variability measures for binomial distributions should be based on a thorough understanding of the data and the research question, and multiple measures should be considered to provide a comprehensive understanding of the data. It is also important to consider the limitations and assumptions of each measure when making a decision.

Binomial Distribution and Selection of Suitable Values

1. What is the binomial distribution and how is it used in science?

2. How do you calculate the probability of a specific number of successes using the binomial distribution?

3. How do you determine suitable values for n and p in the binomial distribution?

4. Can the binomial distribution be used for continuous data?

5. What are some real-life examples of the binomial distribution in science?

Similar threads

Hot Threads

Recent Insights