Statistics and Tchebysheff's theorum

major_maths · Sep 6, 2011

Homework Statement

Let k[itex]\geq[/itex]1. Show that, for any set of n measurements, the fraction included in the interval [itex]\overline{y}[/itex]-ks to [itex]\overline{y}[/itex]+ks is at least (1-1/k²).

[Hint: s² = 1/(n-1)[[itex]\sum[/itex](y_i-[itex]\overline{y}[/itex])²]. In this expression, replace all deviations for which the absolute value of (y_i-[itex]\overline{y}[/itex])[itex]\geq[/itex]ks with ks. Simplify.] This result is known as Tchebysheff's theorem.

2. Homework Equations are the above.

The Attempt at a Solution

I've got no clue what the problem wants, much less how to start a solution.

Stephen Tashi · Sep 7, 2011

major_maths said:

Homework Statement

Let k[itex]\geq[/itex]1. Show that, for any set of n measurements, the fraction included in the interval [itex]\overline{y}[/itex]-ks to [itex]\overline{y}[/itex]+ks is at least (1-1/k²).

[Hint: s² = 1/(n-1)[[itex]\sum[/itex](y_i-[itex]\overline{y}[/itex])²]. In this expression, replace all deviations for which the absolute value of (y_i-[itex]\overline{y}[/itex])[itex]\geq[/itex]ks with ks. Simplify.] This result is known as Tchebysheff's theorem.

Let there be [itex]M[/itex] measurements where [itex]| y_i - \overline{y}| \geq ks[/itex]
If in the sum [itex]\sum(y_i -\overline{y})^2[/itex] we replace those [itex]M[/itex] measurements by [itex]ks[/itex] and leave out the other [itex]N-M[/itex] measurements, we get a smaller sum. The smaller sum is [itex]M (ks)^2[/itex]

Hence

[tex]s^2 = \frac{1}{n-1} \sum(y_i - \overline{y})^2 \geq \frac{1}{n-1} M (ks)^2[/tex]

Since [itex]\frac{1}{n-1} > \frac{1}{n}[/itex]

[tex]s^2 \geq \frac{1}{n-1}M(ks)^2 > \frac{1}{n}M(ks)^2[/tex]
[tex]s^2 \geq \frac{1}{n}M(ks)^2[/tex]

The "fraction of measurements" that [itex]M[/itex] constitutes is [itex]\frac{M}{n}[/itex] and the above inequality can be used to bound it.

The original problem concerns the fraction of measurements other than those M measurements, so that fraction is [itex]1.0 - \frac{M}{n}[/itex].
That needs to be bounded by using the bound for [itex]\frac{M}{n}[/itex].

hassman · Sep 7, 2011

Thank you Stephen. That was part of my homework I was struggling with. I wonder which school OP goes :-).

To be really pedantic, should not the last equation have > sign instead of >=?

BigBrain · May 13, 2012

When we take a look at the definition of theorem number two, we see that the theorem refers to the standard deviation of the possible sample means computed from all possible random samples. Theorem number one is similar in that it says for any population, the average value of all possible sample means computed from all possible random samples of a given size from the population equal the population mean. What does that mean? Does that mean that the mean of my sample will automatically be equal to the population mean?

Statistics and Tchebysheff's theorum

Homework Help Overview

Discussion Character

Approaches and Questions Raised

Discussion Status

Contextual Notes

Homework Statement

The Attempt at a Solution

Homework Statement

Similar threads

Distance between a Clock's hands when the distance is increasing most rapidly

Polar integral

Deriving spatial derivatives

Is this the correct general solution of the given PDE?

J_1(x) = (x^2/10)*(J_1(x) + J_3(x)) How to solve?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect