Uniform convergence in probability

Uniform convergence in probability is a form of convergence in probability in statistical asymptotic theory and probability theory. It means that, under certain conditions, the empirical frequencies of all events in a certain event-family converge to their theoretical probabilities. Uniform convergence in probability has applications to statistics as well as machine learning as part of statistical learning theory.
The law of large numbers says that, for each single event, its empirical frequency in a sequence of independent trials converges to its theoretical probability. But in some applications, we are interested not in a single event but in a whole family of events. We would like to know whether the empirical frequency of every event in the family converges to its theoretical probability simultaneously. The Uniform Convergence Theorem gives a sufficient condition for this convergence to hold. Roughly, if the event-family is sufficiently simple then uniform convergence holds.

Definitions

For a class of predicates defined on a set and a set of samples, where, the empirical frequency of on is
The theoretical probability of is defined as
The Uniform Convergence Theorem states, roughly, that if is "simple" and we draw samples independently from according to any distribution, then with high probability, the empirical frequency will be close to its expected value, which is the theoretical probability.
Here "simple" means that the Vapnik–Chervonenkis dimension of the class is small relative to the size of the sample. In other words, a sufficiently simple collection of functions behaves roughly the same on a small random sample as it does on the distribution as a whole.
The Uniform Convergence Theorem was first proved by Vapnik and Chervonenkis using the concept of growth function.

Uniform convergence theorem

The statement of the uniform convergence theorem is as follows:
If is a set of -valued functions defined on a set and is a probability distribution on then for and a positive integer, we have:
And for any natural number, the shattering number is defined as:
From the point of Learning Theory one can consider to be the Concept/Hypothesis class defined over the instance set. Before getting into the details of the proof of the theorem we will state Sauer's Lemma which we will need in our proof.

Sauer–Shelah lemma

The Sauer–Shelah lemma relates the shattering number to the VC Dimension.
Lemma:, where is the VC Dimension of the concept class.
Corollary:.

Proof of uniform convergence theorem

and are the sources of the proof below. Before we get into the details of the proof of the Uniform Convergence Theorem we will present a high level overview of the proof.

Symmetrization: We transform the problem of analyzing into the problem of analyzing, where and are i.i.d samples of size drawn according to the distribution. One can view as the original randomly drawn sample of length, while may be thought as the testing sample which is used to estimate.
Permutation: Since and are picked identically and independently, so swapping elements between them will not change the probability distribution on and. So, we will try to bound the probability of for some by considering the effect of a specific collection of permutations of the joint sample. Specifically, we consider permutations which swap and in some subset of. The symbol means the concatenation of and.
Reduction to a finite class: We can now restrict the function class to a fixed joint sample and hence, if has finite VC Dimension, it reduces to the problem to one involving a finite function class.

We present the technical details of the proof.

Symmetrization

Lemma: Let and
Then for,.
Proof:
By the triangle inequality,

if and then.
Therefore,
since and are independent.
Now for fix an such that. For this, we shall show that
Thus for any, and hence. And hence we perform the first step of our high level idea.
Notice, is a binomial random variable with expectation and variance. By Chebyshev's inequality we get
for the mentioned bound on. Here we use the fact that for.

Permutations

Let be the set of all permutations of that swaps and in some subset of.
Lemma: Let be any subset of and any probability distribution on. Then,
where the expectation is over chosen according to, and the probability is over chosen uniformly from.
Proof:
For any
The maximum is guaranteed to exist since there is only a finite set of values that probability under a random permutation can take.

Reduction to a finite class

Lemma: Basing on the previous lemma,
Proof:
Let us define and which is at most. This means there are functions such that for any between and with for
We see that iff for some in satisfies,
.
Hence if we define if and otherwise.
For and, we have that iff for some in satisfies. By union bound we get
Since, the distribution over the permutations is uniform for each, so equals, with equal probability.
Thus,
where the probability on the right is over and both the possibilities are equally likely. By Hoeffding's inequality, this is at most.
Finally, combining all the three parts of the proof we get the Uniform Convergence Theorem.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...