Hoeffding inequality for the difference of two sample means?

AI Thread Summary
The discussion centers on Hoeffding's inequality and its application to the difference of two sample means. The original inequality provides a bound for the probability that the sample mean deviates from the expected value, while a corollary extends this to the difference between two sample means. The question raised is how the term (m^{-1} + n^{-1}) appears in the bound for the difference, given that both sample means are bounded between [0,1]. Participants suggest considering the variances and the implications of almost sure boundedness in the context of the inequality. The conversation emphasizes the relationship between the variances of the sample means and the derived bounds.
JanO
Messages
3
Reaction score
0
In W. Hoeffding's 1963 paper* he gives the well known inequality:

P(\bar{x}-\mathrm{E}[x_i] \geq t) \leq \exp(-2t^2n) \ \ \ \ \ \ (1),

where \bar{x} = \frac{1}{n}\sum_{i=1}^nx_i, x_i\in[0,1]. x_i's are independent.

Following this theorem he gives a corollary for the difference of two sample means as:

P(\bar{x}-\bar{y}-(\mathrm{E}[x_i] - \mathrm{E}[y_k]) \geq t) \leq \exp(\frac{-2t^2}{m^{-1}+n^{-1}}) \ \ \ \ \ \ (2),

where \bar{x} = \frac{1}{n}\sum_{i=1}^nx_i, \bar{y} = \frac{1}{m}\sum_{k=1}^my_k, x_i,y_k\in[0,1]. x_i's and y_k's are independent.


My question is: How does (2) follow from (1)?

-Jan

*http://www.csee.umbc.edu/~lomonaco/f08/643/hwk643/Hoeffding.pdf (equations (2.6) and (2.7))
 
Physics news on Phys.org
Hey JanO and welcome to the forums.

One idea I have is to let Z = X + Y and use Z instead of X in the definition.
 
Thanks Chiro for your responce.

However, I still do not understand how the term (m^{-1} + n^{-1}) comes into the bound. Isn't z=\bar{x}-\bar{y} is still bounded between [0,1]?

-Jan
 
JanO said:
Thanks Chiro for your responce.

However, I still do not understand how the term (m^{-1} + n^{-1}) comes into the bound. Isn't z=\bar{x}-\bar{y} is still bounded between [0,1]?

-Jan

Think about what happens to the variances.
 
It seems like bounded here means all most surely bounded. At least that's how Hoeffding inequality seems to be given elsewhere. I guess it then means that z=\bar{x}-\bar{y} is bounded a.s. between [\mu_x-\mu_y-\frac{1}{2}\sqrt{m^{-1}+n^{-1}}, \ \mu_x-\mu_y+\frac{1}{2}\sqrt{m^{-1}+n^{-1}}].?

Thanks again for your help!
 
I was reading documentation about the soundness and completeness of logic formal systems. Consider the following $$\vdash_S \phi$$ where ##S## is the proof-system making part the formal system and ##\phi## is a wff (well formed formula) of the formal language. Note the blank on left of the turnstile symbol ##\vdash_S##, as far as I can tell it actually represents the empty set. So what does it mean ? I guess it actually means ##\phi## is a theorem of the formal system, i.e. there is a...
Back
Top