A Simple Proof for the Chebyshev Inequality: Clearly Explained!

In the Appendix A of this book: Statistics: Principles and Methods written by Giuseppe Cicchitelli, Pierpaolo D’Urso and Marco Minozzo published by Pearson in 2021, I found the most simple proof of the Chebyshev theorem I have ever seen. The idea of using two different sets of element is very smart.

I reproduced the demonstration here:

Let x₁, x₂, …, x_N be a series of observations with mean µ and variance σ². Let

I_k=[i, 1 \leq i \leq N:\left|x_i-\mu\right|< k\sigma ]

be the set of subscripts i identifying the observations whose deviation from the mean is (in absolute value) less than kσ. Let N(I_k) be the number of elements in I_k. We can write

\begin{aligned} \sigma^2 &=\dfrac{\sum_{i=1}^N\left(x_i-\mu\right)^2}{N } \\ N \sigma^2 &=\sum_{i=1}^N\left(x_i-\mu\right)^2 \\ &=\sum_{i \in I_k}\left(x_i-\mu\right)^2+\sum_{i \notin I_k}\left(x_i-\mu\right)^2 \\ & \geq \sum_{i \notin I_k}\left(x_i-\mu\right)^2 \\ & \geq \sum_{i \notin I_k} k^2 \sigma^2 \end{aligned}

where the first inequality holds because the sum of squared deviations from the mean extends over the subset of x_i not belonging to I_k, while the second holds since (xi-µ)² >k² σ².

This last point can be illustrated by numerical values. Indeed, with µ=0, σ=1, and k=2, we have for (xi-µ) ≤k σ :

Computation: https://www.wolframalpha.com/

For (xi-µ)² >k² σ², we have:

Computation: https://www.wolframalpha.com/

Hence,

\sum_{i \notin I_k} k^2 \sigma^2 \leq N \sigma^2,

from which, by dividing both sides of the inequality by N k² σ², we obtain

\frac{1}{N} \sum_{i \notin I_k}(1) \leq \frac{1}{k^2} \Leftrightarrow \frac{N-N\left(I_k\right)}{N} \leq \frac{1}{k^2} .

Recall that the total number of element N is equal to the number of elements in Ik, N(Ik) plus the number of elements that are not in Ik:

N=N\left(I_k\right)+N\left(\text{not }I_k\right)\\ \sum_{i \notin I_k}(1)=N\left(\text{not }I_k\right)=N-N\left(I_k\right)

Finally,

\frac{N\left(I_k\right)}{N} \geq 1-\frac{1}{k^2}

and the result is proved.

Skewness in Wolfram Alpha: Clearly Explained!

The positional average known as the skewness allows you to assess the symmetry of a distribution. When the skewness is to zero, then the distribution is symmetric. You…

Illustrating the sample variance bias with R and Mathematica

In my previous blog, I recall that we can demonstrate in a few steps that the sample variance is an unbiased estimator of the population variance when we…

Unbiased estimator for population variance: clearly explained!

Estimator: A statistic used to approximate a population parameter. Sometimes called a point estimator. Estimate: The observed value of the estimator. Unbiased estimator: An estimator whose expected value…

1 Comment

Strategic Stockpiling Reduces the Geopolitical Risk to the Supply Chain of Copper and Lithium

RePEc’s authors ranking (Last 10 Years Publications)

Countries and periods after a panel estimation with Stata

The Economic Cost of Nationalism

US-China Tensions, US Partisan Conflict and Global Oil Prices: Scapegoating? (Applied Economics Letters)