Hoeffding inequality for the difference of two sample means?

Click For Summary
SUMMARY

The discussion centers on the Hoeffding inequality as presented in W. Hoeffding's 1963 paper, specifically its application to the difference of two sample means. The inequality states that the probability of the difference between the sample mean and the expected value exceeding a threshold is bounded by an exponential function. The corollary for the difference of two sample means introduces the term (m-1 + n-1), which relates to the variances of the samples. Participants explore the implications of this term and its role in establishing bounds for the difference of the means.

PREREQUISITES
  • Understanding of Hoeffding's inequality
  • Familiarity with probability theory and statistical bounds
  • Knowledge of sample means and their properties
  • Basic concepts of variance in statistics
NEXT STEPS
  • Study the derivation of Hoeffding's inequality in detail
  • Explore the implications of variance in statistical bounds
  • Learn about the properties of independent random variables
  • Investigate applications of Hoeffding's inequality in machine learning
USEFUL FOR

Statisticians, data scientists, and researchers interested in probability theory and statistical inference, particularly those working with sample means and their variances.

JanO
Messages
3
Reaction score
0
In W. Hoeffding's 1963 paper* he gives the well known inequality:

P(\bar{x}-\mathrm{E}[x_i] \geq t) \leq \exp(-2t^2n) \ \ \ \ \ \ (1),

where \bar{x} = \frac{1}{n}\sum_{i=1}^nx_i, x_i\in[0,1]. x_i's are independent.

Following this theorem he gives a corollary for the difference of two sample means as:

P(\bar{x}-\bar{y}-(\mathrm{E}[x_i] - \mathrm{E}[y_k]) \geq t) \leq \exp(\frac{-2t^2}{m^{-1}+n^{-1}}) \ \ \ \ \ \ (2),

where \bar{x} = \frac{1}{n}\sum_{i=1}^nx_i, \bar{y} = \frac{1}{m}\sum_{k=1}^my_k, x_i,y_k\in[0,1]. x_i's and y_k's are independent.


My question is: How does (2) follow from (1)?

-Jan

*http://www.csee.umbc.edu/~lomonaco/f08/643/hwk643/Hoeffding.pdf (equations (2.6) and (2.7))
 
Physics news on Phys.org
Hey JanO and welcome to the forums.

One idea I have is to let Z = X + Y and use Z instead of X in the definition.
 
Thanks Chiro for your responce.

However, I still do not understand how the term (m^{-1} + n^{-1}) comes into the bound. Isn't z=\bar{x}-\bar{y} is still bounded between [0,1]?

-Jan
 
JanO said:
Thanks Chiro for your responce.

However, I still do not understand how the term (m^{-1} + n^{-1}) comes into the bound. Isn't z=\bar{x}-\bar{y} is still bounded between [0,1]?

-Jan

Think about what happens to the variances.
 
It seems like bounded here means all most surely bounded. At least that's how Hoeffding inequality seems to be given elsewhere. I guess it then means that z=\bar{x}-\bar{y} is bounded a.s. between [\mu_x-\mu_y-\frac{1}{2}\sqrt{m^{-1}+n^{-1}}, \ \mu_x-\mu_y+\frac{1}{2}\sqrt{m^{-1}+n^{-1}}].?

Thanks again for your help!
 

Similar threads

  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
1
Views
4K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 9 ·
Replies
9
Views
3K