Polar factorization theorem

Theorem in Optimal Transport

In optimal transport, a branch of mathematics, polar factorization of vector fields is a basic result due to Brenier (1987),^[1] with antecedents of Knott-Smith (1984)^[2] and Rachev (1985),^[3] that generalizes many existing results among which are the polar decomposition of real matrices, and the rearrangement of real-valued functions.

The theorem

[edit ]

Notation. Denote $\xi _{\#}\mu$ {\displaystyle \xi _{\#}\mu } the image measure of $\mu$ {\displaystyle \mu } through the map $\xi$ {\displaystyle \xi }.

Definition: Measure preserving map. Let $(X,\mu )$ {\displaystyle (X,\mu )} and $(Y,\nu )$ {\displaystyle (Y,\nu )} be some probability spaces and $\sigma :X\rightarrow Y$ {\displaystyle \sigma :X\rightarrow Y} a measurable map. Then, $\sigma$ {\displaystyle \sigma } is said to be measure preserving iff $\sigma _{\#}\mu =\nu$ {\displaystyle \sigma _{\#}\mu =\nu }, where $\#$ {\displaystyle \#} is the pushforward measure. Spelled out: for every $\nu$ {\displaystyle \nu }-measurable subset $\Omega$ {\displaystyle \Omega } of $Y$ {\displaystyle Y}, $\sigma ^{-1}(\Omega )$ {\displaystyle \sigma ^{-1}(\Omega )} is $\mu$ {\displaystyle \mu }-measurable, and $\mu (\sigma ^{-1}(\Omega ))=\nu (\Omega )$ {\displaystyle \mu (\sigma ^{-1}(\Omega ))=\nu (\Omega )}. The latter is equivalent to:

\int _{X}(f\circ \sigma )(x)\mu (dx)=\int _{X}(\sigma ^{*}f)(x)\mu (dx)=\int _{Y}f(y)(\sigma _{\#}\mu )(dy)=\int _{Y}f(y)\nu (dy)

{\displaystyle \int _{X}(f\circ \sigma )(x)\mu (dx)=\int _{X}(\sigma ^{*}f)(x)\mu (dx)=\int _{Y}f(y)(\sigma _{\#}\mu )(dy)=\int _{Y}f(y)\nu (dy)}

where $f$ {\displaystyle f} is $\nu$ {\displaystyle \nu }-integrable and $f\circ \sigma$ {\displaystyle f\circ \sigma } is $\mu$ {\displaystyle \mu }-integrable.

Theorem. Consider a map $\xi :\Omega \rightarrow R^{d}$ {\displaystyle \xi :\Omega \rightarrow R^{d}} where $\Omega$ {\displaystyle \Omega } is a convex subset of $R^{d}$ {\displaystyle R^{d}}, and $\mu$ {\displaystyle \mu } a measure on $\Omega$ {\displaystyle \Omega } which is absolutely continuous. Assume that $\xi _{\#}\mu$ {\displaystyle \xi _{\#}\mu } is absolutely continuous. Then there is a convex function $\varphi :\Omega \rightarrow R$ {\displaystyle \varphi :\Omega \rightarrow R} and a map $\sigma :\Omega \rightarrow \Omega$ {\displaystyle \sigma :\Omega \rightarrow \Omega } preserving $\mu$ {\displaystyle \mu } such that

$\xi =\left(\nabla \varphi \right)\circ \sigma$ {\displaystyle \xi =\left(\nabla \varphi \right)\circ \sigma }

In addition, $\nabla \varphi$ {\displaystyle \nabla \varphi } and $\sigma$ {\displaystyle \sigma } are uniquely defined almost everywhere.^[1]^[4]

Applications and connections

[edit ]

Dimension 1

[edit ]

In dimension 1, and when $\mu$ {\displaystyle \mu } is the Lebesgue measure over the unit interval, the result specializes to Ryff's theorem.^[5] When $d=1$ {\displaystyle d=1} and $\mu$ {\displaystyle \mu } is the uniform distribution over $\left[0,1\right]$ {\displaystyle \left[0,1\right]}, the polar decomposition boils down to

$\xi \left(t\right)=F_{X}^{-1}\left(\sigma \left(t\right)\right)$ {\displaystyle \xi \left(t\right)=F_{X}^{-1}\left(\sigma \left(t\right)\right)}

where $F_{X}$ {\displaystyle F_{X}} is cumulative distribution function of the random variable $\xi \left(U\right)$ {\displaystyle \xi \left(U\right)} and $U$ {\displaystyle U} has a uniform distribution over $\left[0,1\right]$ {\displaystyle \left[0,1\right]}. $F_{X}$ {\displaystyle F_{X}} is assumed to be continuous, and $\sigma \left(t\right)=F_{X}\left(\xi \left(t\right)\right)$ {\displaystyle \sigma \left(t\right)=F_{X}\left(\xi \left(t\right)\right)} preserves the Lebesgue measure on $\left[0,1\right]$ {\displaystyle \left[0,1\right]}.

Polar decomposition of matrices

[edit ]

When $\xi$ {\displaystyle \xi } is a linear map and $\mu$ {\displaystyle \mu } is the Gaussian normal distribution, the result coincides with the polar decomposition of matrices. Assuming $\xi \left(x\right)=Mx$ {\displaystyle \xi \left(x\right)=Mx} where $M$ {\displaystyle M} is an invertible $d\times d$ {\displaystyle d\times d} matrix and considering $\mu$ {\displaystyle \mu } the ${\mathcal {N}}\left(0,I_{d}\right)$ {\displaystyle {\mathcal {N}}\left(0,I_{d}\right)} probability measure, the polar decomposition boils down to

$M=SO$ {\displaystyle M=SO}

where $S$ {\displaystyle S} is a symmetric positive definite matrix, and $O$ {\displaystyle O} an orthogonal matrix. The connection with the polar factorization is $\varphi \left(x\right)=x^{\top }Sx/2$ {\displaystyle \varphi \left(x\right)=x^{\top }Sx/2} which is convex, and $\sigma \left(x\right)=Ox$ {\displaystyle \sigma \left(x\right)=Ox} which preserves the ${\mathcal {N}}\left(0,I_{d}\right)$ {\displaystyle {\mathcal {N}}\left(0,I_{d}\right)} measure.

Helmholtz decomposition

[edit ]

The results also allow to recover Helmholtz decomposition. Letting $x\rightarrow V\left(x\right)$ {\displaystyle x\rightarrow V\left(x\right)} be a smooth vector field it can then be written in a unique way as

$V=w+\nabla p$ {\displaystyle V=w+\nabla p}

where $p$ {\displaystyle p} is a smooth real function defined on $\Omega$ {\displaystyle \Omega }, unique up to an additive constant, and $w$ {\displaystyle w} is a smooth divergence free vector field, parallel to the boundary of $\Omega$ {\displaystyle \Omega }.

The connection can be seen by assuming $\mu$ {\displaystyle \mu } is the Lebesgue measure on a compact set $\Omega \subset R^{n}$ {\displaystyle \Omega \subset R^{n}} and by writing $\xi$ {\displaystyle \xi } as a perturbation of the identity map

$\xi _{\epsilon }(x)=x+\epsilon V(x)$ {\displaystyle \xi _{\epsilon }(x)=x+\epsilon V(x)}

where $\epsilon$ {\displaystyle \epsilon } is small. The polar decomposition of $\xi _{\epsilon }$ {\displaystyle \xi _{\epsilon }} is given by $\xi _{\epsilon }=(\nabla \varphi _{\epsilon })\circ \sigma _{\epsilon }$ {\displaystyle \xi _{\epsilon }=(\nabla \varphi _{\epsilon })\circ \sigma _{\epsilon }}. Then, for any test function $f:R^{n}\rightarrow R$ {\displaystyle f:R^{n}\rightarrow R} the following holds:

$\int _{\Omega }f(x+\epsilon V(x))dx=\int _{\Omega }f((\nabla \varphi _{\epsilon })\circ \sigma _{\epsilon }\left(x\right))dx=\int _{\Omega }f(\nabla \varphi _{\epsilon }\left(x\right))dx$ {\displaystyle \int _{\Omega }f(x+\epsilon V(x))dx=\int _{\Omega }f((\nabla \varphi _{\epsilon })\circ \sigma _{\epsilon }\left(x\right))dx=\int _{\Omega }f(\nabla \varphi _{\epsilon }\left(x\right))dx}

where the fact that $\sigma _{\epsilon }$ {\displaystyle \sigma _{\epsilon }} was preserving the Lebesgue measure was used in the second equality.

In fact, as $\textstyle \varphi _{0}(x)={\frac {1}{2}}\Vert x\Vert ^{2}$ {\displaystyle \textstyle \varphi _{0}(x)={\frac {1}{2}}\Vert x\Vert ^{2}}, one can expand $\textstyle \varphi _{\epsilon }(x)={\frac {1}{2}}\Vert x\Vert ^{2}+\epsilon p(x)+O(\epsilon ^{2})$ {\displaystyle \textstyle \varphi _{\epsilon }(x)={\frac {1}{2}}\Vert x\Vert ^{2}+\epsilon p(x)+O(\epsilon ^{2})}, and therefore $\textstyle \nabla \varphi _{\epsilon }\left(x\right)=x+\epsilon \nabla p(x)+O(\epsilon ^{2})$ {\displaystyle \textstyle \nabla \varphi _{\epsilon }\left(x\right)=x+\epsilon \nabla p(x)+O(\epsilon ^{2})}. As a result, $\textstyle \int _{\Omega }\left(V(x)-\nabla p(x)\right)\nabla f(x))dx$ {\displaystyle \textstyle \int _{\Omega }\left(V(x)-\nabla p(x)\right)\nabla f(x))dx} for any smooth function $f$ {\displaystyle f}, which implies that $w\left(x\right)=V(x)-\nabla p(x)$ {\displaystyle w\left(x\right)=V(x)-\nabla p(x)} is divergence-free.^[1]^[6]

References

[edit ]

^ ^a ^b ^c Brenier, Yann (1991). "Polar factorization and monotone rearrangement of vector‐valued functions" (PDF). Communications on Pure and Applied Mathematics. 44 (4): 375–417. doi:10.1002/cpa.3160440402 . Retrieved 16 April 2021.
^ Knott, M.; Smith, C. S. (1984). "On the optimal mapping of distributions" . Journal of Optimization Theory and Applications. 43: 39–49. doi:10.1007/BF00934745. S2CID 120208956 . Retrieved 16 April 2021.
^ Rachev, Svetlozar T. (1985). "The Monge–Kantorovich mass transference problem and its stochastic applications" (PDF). Theory of Probability & Its Applications. 29 (4): 647–676. doi:10.1137/1129093 . Retrieved 16 April 2021.
^ Santambrogio, Filippo (2015). Optimal transport for applied mathematicians. New York: Birkäuser. CiteSeerX 10.1.1.726.35 .
^ Ryff, John V. (1965). "Orbits of L1-Functions Under Doubly Stochastic Transformation" . Transactions of the American Mathematical Society. 117: 92–100. doi:10.2307/1994198. JSTOR 1994198 . Retrieved 16 April 2021.
^ Villani, Cédric (2003). Topics in optimal transportation. American Mathematical Society.

v t e Convex analysis and variational analysis
Basic concepts	Convex combination Convex function Convex set
Topics (list)	Choquet theory Convex geometry Convex metric space Convex optimization Duality Lagrange multiplier Legendre transformation Locally convex topological vector space Simplex
Maps	Convex conjugate Concave (Closed K- Logarithmically Proper Pseudo- Quasi-) Convex function Invex function Legendre transformation Semi-continuity Subderivative
Main results (list)	Carathéodory's theorem Ekeland's variational principle Fenchel–Moreau theorem Fenchel-Young inequality Jensen's inequality Hermite–Hadamard inequality Krein–Milman theorem Mazur's lemma Shapley–Folkman lemma Robinson–Ursescu Simons Ursescu
Sets	Convex hull (Orthogonally, Pseudo-) Convex set Effective domain Epigraph Hypograph John ellipsoid Lens Radial set/Algebraic interior Zonotope
Series	Convex series related ((cs, lcs)-closed, (cs, bcs)-complete, (lower) ideally convex, (Hx), and (Hwx))
Duality	Dual system Duality gap Strong duality Weak duality
Applications and related	Convexity in economics

v
t
e

Measure theory

Basic concepts

Sets

Types of measures

Atomic
Baire
Banach
Besov
Borel
Brown
Complex
Complete
Content
(Logarithmically) Convex
Decomposable
Discrete
Equivalent
Finite
Inner
(Quasi-) Invariant
Locally finite
Maximising
Metric outer
Outer
Perfect
Pre-measure
(Sub-) Probability
Projection-valued
Radon
Random
Regular
- Borel regular
- Inner regular
- Outer regular
Saturated
Set function
σ-finite
s-finite
Signed
Singular
Spectral
Strictly positive
Tight
Vector

Particular measures

Maps

Measurable function
- Bochner
- Strongly
- Weakly
Convergence: almost everywhere
of measures
in measure
of random variables
- in distribution
- in probability
Cylinder set measure
Random: compact set
element
measure
process
variable
vector
Projection-valued measure

Main results

Carathéodory's extension theorem
Convergence theorems
Decomposition theorems
- Hahn
- Jordan
- Maharam's
Egorov's
Fatou's lemma
Fubini's
- Fubini–Tonelli
Hölder's inequality
Minkowski inequality
Radon–Nikodym
Riesz–Markov–Kakutani representation theorem

Other results

Disintegration theorem Lifting theory Lebesgue's density theorem Lebesgue differentiation theorem Sard's theorem Vitali–Hahn–Saks theorem
For Lebesgue measure	Isoperimetric inequality Brunn–Minkowski theorem Milman's reverse Minkowski–Steiner formula Prékopa–Leindler inequality Vitale's random Brunn–Minkowski inequality

Applications & related

Retrieved from "https://en.wikipedia.org/w/index.php?title=Polar_factorization_theorem&oldid=1292434511"

The theorem

Applications and connections

Dimension 1

Polar decomposition of matrices

Helmholtz decomposition

See also

References