One can view a probability measure on a space as a βpile of massβ, for example, of sand, on the space . Using this picture, given two probability spaces and , there could be many ways of moving the mass from to in such a way that the sand from the pile is arranged to form the pile . (The mass from which point goes to which point, or points?) This βway of moving the massβ is called a transport plan, and it is usually encoded by a joint distribution or by a Markov kernel (see below).
It is useful to keep track of in which way we are rearranging the mass to form , and we can see these different ways as different morphisms, between the objects and , in a category of couplings.
Let and be probability spaces. A coupling or transport plan between and is a probability space where
is the tensor product sigma-algebra on the product space (generated by the sets with and );
the measure has and as marginals, in the sense that for all and ,
Given a probability space , the identity coupling or diagonal coupling is given by the following measure on :
for all .
Intuitively, this is a copy of on concentrated on the diagonal subset . (Whenever is standard Borel, the diagonal subset is measurable, and so this intuition can be made precise.)
This coupling gives the identity in the category of couplings. In terms of transport plans, this corresponds to not moving any mass (almost surely).
Given probability spaces and the independent coupling or product coupling or constant coupling is given by the product measure , i.e.
for all and .
In terms of transport plans, this arranges the mass from almost all points of to a distribution proportional to , (almost surely) independently of the point of origin.
Let , , be standard Borel probability spaces, and consider transport plans from to and from to . The composite transport plan from to is defined as follows:
for all and , and where and are the regular conditional distributions associated to and given . The interpretation is that the mass in moved according to the plan and then according to the plan , and in case the transport is stochastic, the two transitions are taken independently.
This construction gives composition in the category of couplings. When the transport plans are induced by functions or kernels (see below), the composition of transport plans is given by the composition of functions or kernels.
In Kozen-Silva-Voogdβ23, this construction was extended beyond the standard Borel case. (See there for the details.)
Let be a measure-preserving function. One can define the βdeterministicβ transport plan as follows,
for all and . Intuitively, this maps all the mass at to the point , for every .
Note that in general there may exist no measure-preserving function between two probability spaces, for example, on the real line, if is a Dirac delta and is not. A construction that always exists is in terms of Markov kernels, see below.
Let be a measure-preserving Markov kernel. One can define a transport plan as follows,
for all and . Intuitively, this maps all the mass at to a measure on proportional to the measure .
Note that in the formula above, the measure is invoked only for almost all , and so it is insensitive to changes in on a -measure-zero set. In a certain sense, this transporting the mass of , more than the single points .
In many cases, such as if and are standard Borel, every transport plan is in the form for some . See also the discussion at "categories of couplings".
Couplings are in some sense undirected, meaning that every transport plan from to can also be seen as (and canonically induces) a transport plan from to .
This makes the category of couplings canonically a dagger category.
For transport plans specified by kernels, this symmetry corresponds exactly to Bayesian inversion of kernels.
optimal transport?
Cedric Villani, Optimal transport: old and new, Springer, 2008.
Fredrik Dahlqvist, Vincent Danos, Ilias Garnier, and Alexandra Silva, Borel kernels and their approximation, categorically, MFPS 2018. arXiv.
Dexter Kozen, Alexandra Silva, Erik Voogd, Joint Distributions in Probabilistic Semantics, MFPS 2023. (arXiv)
Paolo Perrone, Lifting couplings in Wasserstein spaces, 2021. (arXiv:2110.06591)
Last revised on July 13, 2024 at 10:07:09. See the history of this page for a list of all contributions to it.