nLab convex mixture

Contents

Context

Analysis

Functional analysis

Measure and probability theory

Contents

Idea

Convex mixtures are to convex combinations as integrals are to sums.

In other words, convex mixtures are given by integrating a function with respect to a measure whose total normalization is one (hence, a probability measure).

The concept is used in analysis and geometry as a generalization of midpoints, as well as in probability theory to take expectation values of random variables.

In terms of category theory, it is tightly related to the Giry monad and other probability monads.

Constructions

For real numbers

Let XX be a measurable space. We can view a function f:Xf \colon X\to\mathbb{R} as an indexed collection of real numbers. If XX is finite, one can take the average, or midpoint:

f(x 1)++f(x n)n \frac{f(x_1)+\dots+f(x_n)}{n}

More generally, given a probability distribution pp on XX, one can take the weighted average, or convex combination,

xXf(x)p(x). \sum_{x\in X} f(x)\,p(x) .

The resulting number is a convex combination of the numbers f(x)f(x).

Convex mixtures generalize this to the infinite case: if the function ff is measurable, given a probability measure pp on XX, one can take, if it exists, the integral

Xf(x)p(dx). \int_X f(x) \,p(d x) \,.

One can view the resulting value as a mixture, analogous to a convex combination, of the points f(x)f(x). (More precisely, of the points in the support of the pushforward measure f *pf_*p on \mathbb{R}.)

In the language of probability theory, we are taking the expectation value of the random variable ff on the probability space (X,p)(X,p).

The very similar construction can be given by replacing \mathbb{R} with a generic Banach space, using Bochner integrals?.

For probability measures

Given measurable spaces XX and YY, we can consider the space of probability measures PYP Y on YY, and equip it with its canonical sigma-algebra (see Giry monad). This way, a measurable function f:XPYf:X\to P Y is equivalently a Markov kernel k f:XYk_f:X\to Y. Given a probability measure pp on XX, the resulting convex mixture is, equivalently, the composition of the Markov kernel with the measure pp (seen as a kernel from the one-point space): for every measurable subset BB of YY,

Xf(x)(B)p(dx)= Xk f(B|x)p(dx). \int_X f(x)(B)\,p(d x) \;=\; \int_X k_f(B|x)\,p(d x) .

In other words, mixtures of measures are equivalently compositions in the category of Markov kernels.

In particular, taking ff to be the identity (i.e. nonparametrically), a mixture of probability measures on YY is given by integrating a probability measure over probability measures over YY, πPPY\pi\in P P Y:

B PYq(B)π(dq). B \;\mapsto\; \int_{PY} q(B) \, \pi(d q) .

This is equivalently the multiplication of the Giry monad and of most probability monads (more below).

In terms of monad algebras

The Giry monad is the most general monad of probability measures on the category of measurable spaces. Algebras of the Giry monad, therefore, can be interpreted exactly as spaces where one can form arbitrary convex mixtures. This is analogous to how convex spaces, where one can form arbitrary convex combinations, are the algebras of the distribution monad.

In particular, mixtures of probability measures can be seen as instances of the multiplication of the Giry monad, or more generally, as compositions of Markov kernels. Indeed, Markov kernels are the Kleisli morphisms of the Giry monad, and hence their composition is defined in terms of the monad multiplication.

Similar notions of mixture can be given using other probability monads. Note that, in order to have a convex mixture, one needs a form of normalization.

Examples

See also

Last revised on July 11, 2024 at 21:24:30. See the history of this page for a list of all contributions to it.