Théo Dumont
PhD student in optimal transport & geometry @ Université Gustave Eiffel
Théo Dumont
slides available at https://slides.com/theodumont/geometric-ot
Definition. (OT Kantorovitch problem)
not feasible by a map!
π is induced by a transport map φ
π is a transport plan
relaxation
∣detdφ−1∣μ0∘φ−1
[Monge, 1781], [Kantorovitch, 1942]
Optimal Transport
Definition. (OT Monge problem)
Can we say that the solution of (K) is a map?
?
If μ0 has a density, then there is a unique solution to (K), and it is of the form φ=∇f with f:Rn→R convex.
Theorem. (Brenier)
relaxation
π is induced by a transport map φ
π is a transport plan
Monge (maps)
Kantorovitch (plans)
[Brenier, 1987]
Optimal Transport
Smooth densities:
can we recover classical results of OT theory with a geometric picture?
?
Definition. (Smooth OT problem)
Smooth OT
Smooth maps:
not right-invariant! only by action of Diffμ0(Rn)
!
Theorem. (Geodesic equation on Diff(Rn))
or if we write φ˙=v∘φ,
inviscid Burgers v˙+∇vv=0
horizontal?
finding the shortest geodesic from id to C(μ0,μ1)
OT(μ0,μ1)=φ∈C(μ0,μ1)mind2(id,φ)
A first link with OT
[Otto, 2001], [Kriegl & Michor, 1997], [Modin, 2015]
Diffeomorphism group
π:Diff(Rn)⟶Dens(Rn)
φ⟼φ∗μ0
Fiber over μ1:
{φ∣φ∗μ0=μ1}=C(μ0,μ1)
Fiber over μ0:
{φ∣φ∗μ0=μ0}=Diffμ0(Rn)
dπ(φ):TφDiff(Rn)⟶TμDens(Rn)
v∘φ⟼−div(μv)
The submersion
π:φ↦φ∗μ0 is a smooth submersion.
vertical distribution:
horizontal distribution:
Any u∈X(Rn) can be written as
u=v+∇p
with div(μ0v)=0 and p∈C∞(Rn).
Theorem. (Helmholtz/hodge decomposition)
right-invariance under action of fiber Diffμ0(Rn)
π induces a metric on Diff(Rn)/Diffμ0(Rn)≃Dens(Rn)
independent
of G
depends
on G!
The submersion
(smooth submersion)
π is a Riemannian submersion
Pythagoras ⟹φ˙ has to be horizontal!
where μ˙ and ∇p are linked by μ˙=−div(μ∇p)
Theorem. (Induced metric on Dens(Rn))
where Δμp=div(μ∇p).
The submersion
(Riemannian submersion)
what do the geodesics look like in Dens(Rn)?
?
geodesics in Dens(Rn)
horizontal geodesics in Diff(Rn)
⟺
(Hamilton-Jacobi)
(continuity equation)
Theorem. (Geodesic equation on Dens(Rn))
The induced geodesic distance on Dens(Rn) is the OT distance!
(see previous computation d2(id,φ)=OT(μ0,μ1))
at time t=1: φ1=id+∇p=∇f
what's the final map?
?
Brenier's theorem
The submersion
(Riemannian submersion)
⟹ this is just finding the curve of minimal energy between μ0 and μ1 in Dens(Rn),
i.e. finding a (horizontal) geodesic!
where μ˙t+div(μtvt)=0, and where μt has the right endpoints.
[Benamou & Brenier, 2000]
Usefulness:
Applications
Benamou-Brenier
Theorem. (Benamou-Brenier, dynamic formulation of OT)
Any φ∈Diff(Rn) can be written as
φ=∇f∘ϕ
with f∈C∞(Rn) and ϕ∈Diffμ0(Rn).
[Brenier, 1987]
Applications
Polar factorization
Theorem. (Polar factorization)
The gradient flow of μ w.r.t. a functional F is
Fréchet derivative
Example:
entropy
potential
+ heat flow
Fokker-Planck equation.
Applications
Gradient flows
Definition. (Gradient flow)
OT = finding the shortest geodesic from id to constraint set C(μ0,μ1)
horizontal
We recover:
cf. Benamou-Brenier formulation
Riemannian submersion (Diff(Rn),L2(μ0))⟶π(Dens(Rn),OT)
Geodesic equation on... | is... |
---|---|
inviscid Burgers | |
incompressible Euler | |
Hamilton-Jacobi + contin. eqn. |
Gradient flow of... | is... |
---|---|
entropy | heat flow |
entropy + potential | Fokker-Planck |
loss functional L | training inf. wide NN |
[Chizat & Bach, 2018]
(local)
Recap
OT | Inform. theory | Unbalanced OT | LDDMM | Metamorphoses | |
---|---|---|---|---|---|
top space | |||||
metric | |||||
right-invariant? | |||||
bottom space | anything with an action |
anything with an action | |||
action | ... | ... | |||
metric | Wasserstein | Fisher-Rao | Wasserstein-Fisher-Rao | induced metric | induced metric |
[Bauer, Bruveris & Michor, 2016], [Gallouët & Vialard, 2018], [Younes, 2010], [Trouvé & Younes, 2005], [Modin, 2015]
finite-dimensional equivalents when restricting to Gaussian measures:
submersion GL(n)→PSD(n),
induces Bures-Wasserstein and Fisher-Rao
Some other submersions
L. Ambrosio, N. Gigli, and G. Savaré. Gradient flows: in metric spaces and in the space of probability measures, 2005.
J.-D. Benamou and Y. Brenier. A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem, 2000.
M. Bauer, M. Bruveris, and P.W. Michor. Uniqueness of the Fisher–Rao metric on the space of smooth densities, 2016.
Y. Brenier. Décomposition polaire et réarrangement monotone des champs de vecteurs, 1987.
L. Chizat and F. Bach. On the global convergence of gradient descent for over-parameterized models using optimal transport, 2018.
W. Gangbo, H.K. Kim, and T. Pacini. Differential forms on Wasserstein space and infinite-dimensional Hamiltonian systems, 2010.
T. Gallouët and F.-X. Vialard. The Camassa–Holm equation as an incompressible Euler equation: A geometric point of view, 2018.
A. Kriegl and P.W. Michor. The convenient setting of global analysis, 1997.
K. Modin. Geometry of matrix decompositions seen through optimal transport and information geometry, 2016.
F. Otto. The geometry of dissipative evolution equations: the porous medium equation, 2001.
A. Trouvé and L. Younes. Metamorphoses through lie group action, 2005.
L. Younes. Shapes and diffeomorphisms, 2010.
slides available at https://slides.com/theodumont/geometric-ot
•
•
•
•
•
•
•
•
•
•
•
•
References
By Théo Dumont
Talk about the infinite-dimensional Riemannian geometry of Optimal Transport for the shape analysis seminar (https://shape-analysis.github.io/) at the MAP5 lab.
PhD student in optimal transport & geometry @ Université Gustave Eiffel