What to do with biased estimators if we don't know the bias term?

  • #1
fog37
1,568
108
TL;DR Summary
dealing with biased estimators
Hello,
I understand that we have a population of values. We don't know the parameters of this population. The parameters are numbers, each one describing the population in a collective sense. Examples of parameters are the mean, the median, the mode, the variance, skewness, kurtosis, etc.

We then take a single random sample and work with it to estimate the population parameters. For some parameters, the estimator we use to estimate the parameter itself is unbiased: it means that, on average, if we took many many samples, the average of the estimates, one from each sample, would end up being equal to the population parameter itself. That is great. The estimates, based on the CLM, will approximate a normal distribution centered at the population parameter....
  • What if the estimator we choose use to estimate a specific population parameter is "biased"? We always prefer for an estimator to be unbiased but I guess that is not always possible....Why not?
  • When an estimator is biased, the average of all the estimates (if we collected infinitely many) will not be equal to the parameter itself. The expectation value of estimate will be off by a fixed bias/constant term ##b## from the true population parameter. That would not be good! The sampling distribution of all the sample estimates will still tend to be normal. Conceptually, what do we if we don't know the bias term ##b##? Are there situations in which we would be able to know the magnitude of ##b##? Are there techniques we can use to figure ##b## out from the single random sample that we collected?
thank you!
 
Physics news on Phys.org
  • #2
fog37 said:
What if the estimator we choose use to estimate a specific population parameter is "biased"? We always prefer for an estimator to be unbiased but I guess that is not always possible....Why not?
It is not true that the unbiased estimator is always best. It depends on what the distribution, parameter, and use of the parameter are. See this example of the parameter of the Poisson distribution.
fog37 said:
  • When an estimator is biased, the average of all the estimates (if we collected infinitely many) will not be equal to the parameter itself. The expectation value of estimate will be off by a fixed bias/constant term ##b## from the true population parameter. That would not be good! The sampling distribution of all the sample estimates will still tend to be normal. Conceptually, what do we if we don't know the bias term ##b##? Are there situations in which we would be able to know the magnitude of ##b##? Are there techniques we can use to figure ##b## out from the single random sample that we collected?
It is illustrative to consider the equation for the sample variance when the sample mean, ##\bar X##, is used rather than the true population mean, ##\mu##:
##\sum {(x_i - \bar X)}/(n-1)##
IMO, the natural first guess would be to divide by ##n## rather than by ##(n-1)##. But that is a biased estimator. Using the estimated mean, ##\bar X##, rather than the true population mean, ##\mu##, gives a smaller summation because ##\bar X## tends to be closer to the majority of the sample than ##\mu## is. Luckily, dividing by ##(n-1)## gives an unbiased estimator.
In other situations, I think that the best thing to do about a bias depends on the distribution, the parameter, and your intended use of the parameter.
 
  • Like
Likes fog37

1. What is a biased estimator and why is it important to address bias?

A biased estimator is a statistical estimator that does not equal the true value of the parameter it is estimating, on average. This is important to address because bias can lead to systematic errors in conclusions and predictions made from data, potentially leading to incorrect decisions or interpretations.

2. What are common methods to reduce the impact of bias if the bias term is unknown?

If the bias term is unknown, methods like bootstrapping can be used to estimate the distribution of the estimator and potentially correct for bias. Another approach is using robust statistics that are less sensitive to errors in model assumptions. Additionally, increasing sample size can sometimes reduce bias, depending on the nature of the estimator and the data.

3. Can we still use biased estimators if we do not know the bias term?

Yes, biased estimators can still be useful, especially if the bias is small relative to the variance of the estimator or if the estimator is more efficient (i.e., has lower variance) than unbiased alternatives. In such cases, the trade-off between bias and variance needs to be considered, and practical significance should be evaluated.

4. How does simulation help in understanding the impact of bias in estimators?

Simulation can be a powerful tool to understand how bias affects statistical estimation. By simulating data from a known model, one can apply different estimators and directly observe the extent and nature of the bias. This helps in assessing the robustness of an estimator under various scenarios and choosing the appropriate method for analysis.

5. Are there any statistical tests to detect the presence of bias in an estimator?

While there are no direct tests to detect bias universally, methods like cross-validation and comparing estimators with known unbiased counterparts can provide insights. Analyzing the consistency of an estimator across different subsets of data can also indicate potential biases. However, these methods often require assumptions about the data or the form of the bias.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
472
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
9
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
23
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
486
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
662
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
10
Views
1K
Back
Top