What to do with biased estimators if we don't know the bias term?

  • Context: Undergrad 
  • Thread starter Thread starter fog37
  • Start date Start date
  • Tags Tags
    Bias Estimators
Click For Summary
SUMMARY

This discussion focuses on the challenges of using biased estimators in statistical analysis when the bias term is unknown. It highlights that while unbiased estimators are preferred, they are not always available or optimal depending on the distribution and parameter in question. The conversation emphasizes the importance of understanding the sampling distribution and the implications of using sample statistics, such as the sample mean, in calculating variance. Techniques for estimating the bias term from a single sample are also explored, underscoring the complexity of statistical estimation.

PREREQUISITES
  • Understanding of population parameters such as mean, variance, and skewness
  • Familiarity with sampling distributions and the Central Limit Theorem (CLM)
  • Knowledge of statistical estimators and their properties (biased vs. unbiased)
  • Basic proficiency in calculating sample variance and mean
NEXT STEPS
  • Research techniques for estimating bias in statistical estimators
  • Study the properties of biased versus unbiased estimators in different distributions
  • Learn about the implications of using sample mean in variance calculations
  • Explore the application of the Poisson distribution in statistical estimation
USEFUL FOR

Statisticians, data analysts, and researchers involved in statistical modeling and estimation, particularly those dealing with biased estimators and sampling distributions.

fog37
Messages
1,566
Reaction score
108
TL;DR
dealing with biased estimators
Hello,
I understand that we have a population of values. We don't know the parameters of this population. The parameters are numbers, each one describing the population in a collective sense. Examples of parameters are the mean, the median, the mode, the variance, skewness, kurtosis, etc.

We then take a single random sample and work with it to estimate the population parameters. For some parameters, the estimator we use to estimate the parameter itself is unbiased: it means that, on average, if we took many many samples, the average of the estimates, one from each sample, would end up being equal to the population parameter itself. That is great. The estimates, based on the CLM, will approximate a normal distribution centered at the population parameter....
  • What if the estimator we choose use to estimate a specific population parameter is "biased"? We always prefer for an estimator to be unbiased but I guess that is not always possible....Why not?
  • When an estimator is biased, the average of all the estimates (if we collected infinitely many) will not be equal to the parameter itself. The expectation value of estimate will be off by a fixed bias/constant term ##b## from the true population parameter. That would not be good! The sampling distribution of all the sample estimates will still tend to be normal. Conceptually, what do we if we don't know the bias term ##b##? Are there situations in which we would be able to know the magnitude of ##b##? Are there techniques we can use to figure ##b## out from the single random sample that we collected?
thank you!
 
Physics news on Phys.org
fog37 said:
What if the estimator we choose use to estimate a specific population parameter is "biased"? We always prefer for an estimator to be unbiased but I guess that is not always possible....Why not?
It is not true that the unbiased estimator is always best. It depends on what the distribution, parameter, and use of the parameter are. See this example of the parameter of the Poisson distribution.
fog37 said:
  • When an estimator is biased, the average of all the estimates (if we collected infinitely many) will not be equal to the parameter itself. The expectation value of estimate will be off by a fixed bias/constant term ##b## from the true population parameter. That would not be good! The sampling distribution of all the sample estimates will still tend to be normal. Conceptually, what do we if we don't know the bias term ##b##? Are there situations in which we would be able to know the magnitude of ##b##? Are there techniques we can use to figure ##b## out from the single random sample that we collected?
It is illustrative to consider the equation for the sample variance when the sample mean, ##\bar X##, is used rather than the true population mean, ##\mu##:
##\sum {(x_i - \bar X)}/(n-1)##
IMO, the natural first guess would be to divide by ##n## rather than by ##(n-1)##. But that is a biased estimator. Using the estimated mean, ##\bar X##, rather than the true population mean, ##\mu##, gives a smaller summation because ##\bar X## tends to be closer to the majority of the sample than ##\mu## is. Luckily, dividing by ##(n-1)## gives an unbiased estimator.
In other situations, I think that the best thing to do about a bias depends on the distribution, the parameter, and your intended use of the parameter.
 
  • Like
Likes   Reactions: fog37

Similar threads

  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
5
Views
5K