Clayton Shonkwiler PRO
Mathematician and artist
A ring polymer in solution takes on an ensemble of random shapes, with topology (knot type!) as the unique conserved quantity.
Knotted DNA
Wassermann et al.
Science 229, 171–174
Knot complexity in DNA from P4 tailless mutants
Arsuaga et al., PNAS 99 (2002), 5373–5377
Is this surprising?
A polymer in solution takes on an ensemble of random shapes, with topology as the unique conserved quantity.
Modern polymer physics is based on the analogy between a polymer chain and a random walk.
– Alexander Grosberg
Protonated P2VP
Roiter/Minko
Clarkson University
Plasmid DNA
Alonso–Sarduy, Dietler Lab
EPF Lausanne
Generate \(n\) independent uniform random points on \(S^2\) and treat them as an ordered list of edge vectors.
Alvarado, Calvo, Millett, J. Stat. Phys. 143 (2011), 102–138
Random Polygon \(\Leftrightarrow\) point in some (nice!) conformation space
Knowledge of the (differential, symplectic, algebraic) geometry of these conformation spaces leads to both
theorems and fast numerical algorithms for studying and comparing polygons in \(\mathbb{R}^3\).
The edges of an \(n\)-gon (up to translation) are unit vectors, so the space of \(n\)-gons is the submanifold of \((S^2)^n\) implicitly defined by \(e_1 + \ldots + e_n = 0\), which is codimension 3.
The natural measure is Hausdorff measure; equivalently, the volume form induced by the Riemannian submanifold metric.
A random closed \(n\)-edge polygon is a \(k\)-edge random flight and an \((n-k)\)-edge random flight, conditioned on having the same end-to-end distance.
Classical Fact: The density of the end-to-end vector of an \(n\)-step random walk is
Proof: Fourier transform (since \(\mathrm{sinc}\) is the transform of the boxcar function).
This is piecewise-polynomial in \(\ell\) of degree \(n-3\)
Proposition (with Cantarella) The pdf of the chord connecting \(v_1\) with \(v_{k+1}\) in an \(n\)-gon is
where \(C_n = 2^{n-5}\pi^{n-4} \int_{-\infty}^{\infty} x^2 \,\mathrm{sinc}^n x \,\mathrm{d}x\).
Fact: This is piecewise-polynomial in \(\ell\) of degree \(n-4\).
Why are the expectations rational?
Why degree \(n-4\)?
Where are these crazy polynomials coming from?
Continuous symmetry \(\Rightarrow\) conserved quantity
Duistermaat–Heckman Theorem (stated informally)
On a \(2m\)-dimensional symplectic manifold, \(d\) commuting Hamiltonian symmetries (a Hamiltonian \(T^d=(S^1)^d\)-action) induce \(d\) conserved quantities (momenta).
The joint distribution of the momenta on \(\mathbb{R}^d\) is continuous, piecewise polynomial, degree \(\leq m-d\).
A symplectic manifold \((M,\omega)\) is an even-dimensional manifold \(M\) together with a closed, non-degenerate 2-form \(\omega\).
Example: \((S^2,d\theta\wedge dz)\)
Example: \((\mathbb{R}^2,dx \wedge dy) = (\mathbb{C},\frac{i}{2}dz \wedge d\bar{z})\)
Example: \((T^*\mathbb{R}^n, \sum_{i=1}^n dq_i \wedge dp_i)\)
Example: \((S^2,\omega)\), where \(\omega_p(u,v) = (u \times v) \cdot p\)
Example: \((\mathbb{R}^2,\omega)\) where \(\omega(u,v) = \langle i u, v \rangle \)
This is the right setting for Hamiltonian mechanics.
Example: \((\mathbb{C}^n, \frac{i}{2} \sum dz_k \wedge d\overline{z}_k)\)
Theorem (Archimedes, Duistermaat–Heckman)
Let \(f: S^2 \to \mathbb{R}\) be given by \(f(x,y,z) = z\). Pushing forward the uniform measure on \(S^2\) to the image \([-1,1]\) gives Lebesgue measure.
Proposition If we rotate both factors of \(S^2 \times S^2\) simultaneously around the \(z\)-axis, the conserved quantity is \(z_1 + z_2\), and the pushforward measure on \([-2,2]\) has density given by the triangle function.
Duistermaat–Heckman Theorem (stated informally) On a \(2m\)-dimensional symplectic manifold, \(d\) commuting Hamiltonian symmetries (a Hamiltonian \(T^d\)-action) induce \(d\) conserved quantities (momenta).
The joint distribution of the momenta is continuous, piecewise polynomial, degree \(\leq m-d\).
\(n\)-gons up to rotation are \(2m=(2n-6)\)-dimensional, so chord length is piecewise polynomial of degree \(\leq\)
\(m-1=(n-3)-1=n-4\)
We actually have more symmetries!
Rotations around \(n-3\) chords \(d_i\) by \(n-3\) angles \(\theta_i\) commute.
Theorem (with Cantarella, ’16)
The joint distribution of \(d_1,\ldots , d_{n-3}\) and \(\theta_1, \ldots , \theta_{n-3}\) are all uniform on their domains.
Proof: Check that D–H applies (this is the hard part, since the torus action is not defined everywhere).
Then count: \(m=n-3\) and we have \(d=n-3\) symmetries, so the pdf of the \(d_i\) is piecewise polynomial of degree \(\leq\)
\(m-d = (n-3)-(n-3)=0\).
Since the pdf is continuous and the domain is connected, it must be constant.
Definition: If \((M,\omega)\) is \(2m\)-dimensional and has \(m\) commuting symmetries, we call \(M\) toric.
The \((n-3)\)-dimensional moment polytope \(\mathcal{P}_n \subset \mathbb{R}^{n-3}\) is defined by the triangle inequalities
There exists an almost-everywhere defined map \(\alpha: \mathcal{P}_n \times (S^1)^{n-3} \to \{n\text{-gons}\}\) which is measure-preserving.
This is only sensible as a map to polygons modulo translation and rotation.
We have reduced the problem of sampling a random polygon to the problem of sampling a (specific) convex polytope.
Theorem (Smith, 1984)
For any convex polytope \(\mathcal{P}\), the hit-and-run Markov chain is uniformly ergodic with respect to Lebesgue measure on \(\mathcal{P}\).
Definition (TSMCMC\((\beta)\))
If \(x_k = (p_k,\theta_k) \in \mathcal{P_n} \times T^{n-3}\), define \(x_{k+1}\) by:
At each step, construct the corresponding polygon \(\alpha(p_n, \theta_n)\).
Theorem (with Cantarella, 2016)
TSMCMC(\(\beta\)) is uniformly ergodic with respect to the standard probability measure on random polygons modulo translation and rotation.
Theorem (with Cantarella, 2016)
If \((M,\omega)\) is toric symplectic with moment polytope \(\mathcal{P} \subset \mathbb{R}^m\), then TSMCMC(\(\beta\)) on \(\mathcal{P} \times T^m\) is uniformly ergodic with respect to the symplectic volume measure on \(M\).
For example, polygons in (rooted) spherical confinement.
For unconfined polygons, we can do better: no Markov chains required!
Introduce fake chordlengths \(d_0=1=d_{n-2}\) and make the linear change of variables
\(s_i = d_i - d_{i-1} \text{ for } 1 \leq i \leq n-2\).
Then \(\sum s_i = d_{n-2} - d_0 = 0\), so \(s_{n-2}\) is determined by \(s_1, \ldots , s_{n-3}\)
and the inequalities
become
\(-1 \leq s_i \leq 1, -1 \leq \sum_{i=1}^{n-3} s_i \leq 1\)
\(\sum_{j=1}^i s_j + \sum_{j=1}^{i+1}s_j \geq -1\)
Let \(\mathcal{C}_n \subset \mathbb{R}^{n-3}\) be determined by
\(-1 \leq s_i \leq 1, -1 \leq \sum_{i=1}^{n-3} s_i \leq 1\)
\(\sum_{j=1}^i s_j + \sum_{j=1}^{i+1}s_j \geq -1\)
\(\mathcal{C}_5\)
\(\mathcal{C}_6\)
Proposition (with Cantarella, Duplantier, Uehara, 2016)
The probability that a point in the hypercube lies in \(\mathcal{C}_n\) is asymptotic to
\(6 \sqrt{\frac{6}{\pi}}\frac{1}{n^{3/2}}\)
Theorem (Khoi, Takakura, Mandini, 2000s)
The volume of \(\mathcal{C}_n\) is exactly
Observation (Edwards, 1922)
[T]here is in these days far too great a tendency on the part of teachers to push on their pupils so fast to the Higher Branches of Analysis or to Physical Mathematics that many have neither time nor opportunity for the cultivation of real personal proficiency, or for the acquirement of that individual manipulative skill which is essential to any real confidence of the student in his own power to conduct unaided investigation.
Action-Angle Method
Theorem (with Cantarella, Duplantier, Uehara, 2016)
The action-angle method directly samples polygon space in expected time \(\Theta(n^{5/2})\).
\(O(n)\) time
acceptance probability \(\sim \frac{1}{n^{3/2}}\)
RandomDiagonals[n_] :=
Accumulate[
Join[{1}, RandomVariate[UniformDistribution[{-1, 1}], n]]];
InMomentPolytopeQ[d_] :=
And[Last[d] >= 0, Last[d] <= 2,
And @@ (Total[#] >= 1 & /@ Partition[d, 2, 1])];
DiagonalSample[n_] := Module[{d},
For[d = RandomDiagonals[n], ! InMomentPolytopeQ[d], ,
d = RandomDiagonals[n]];
d[[2 ;;]]
];
Not the first direct sampler (Grosberg–Moore), but simple and fast.
Frequency plot of the HOMFLY polynomials produced by sampling 10 million random 60-gons (there were a total of 6371 distinct HOMFLYs).
cf. Baiesi–Orlandini–Stella
If we round every point \((d_1),\ldots , d_{n-3})\) in the moment polytope to the nearest point with half-integer coordinates \(\frac{1}{2}(x_1, \ldots , x_{n-3})\) …
… and subdivide the regions of attraction by
ordering and sign of \(d_i - \frac{1}{2} x_i\), we get a collection of identical, orthogonal simplices of equal area (this works for any \(n\)):
Proposition*: The number of half-integer points in \(\mathcal{P}_n\) is the Motzkin number
Therefore, there are exponentially many simplices in this decomposition. ☹️
Theorem (K. Chapman, in progress): With \(O(n^3)\) preprocessing of the transition probabilities, a direct sampling algorithm for equilateral \(n\)-gons in \(O(n^2 (\log n)^2)\) time.
The simplices are indexed by certain permutations.
One can recursively construct Lehmer codes of valid permutations using a Markov chain.
What’s the corresponding story for planar \(n\)-gons? Or \(n\)-gons in \(\mathbb{R}^d\)?
The space of planar \(n\)-gons up to translation and rotation is \((n-3)\)-dimensional, so there is no chance of a symplectic structure.
Proposition: The space of \(n\)-gons in \(\mathbb{R}^d\) is the quotient of (almost all) of the space \((S^{d-1})^n\) by the (diagonal) action of the Möbius group.
Current Project: Use this structure to provide robust coordinates for polygons in all dimensions.
Question: Can this be adapted to give an algorithm for sampling the natural probability measure on \(n\)-gons?
The symplectic geometry of closed equilateral random walks in 3-space
J. Cantarella & C. Shonkwiler
Annals of Applied Probability 26 (2016), no. 1, 549–596
A fast direct sampling algorithm for equilateral closed polygons
J. Cantarella, B. Duplantier, C. Shonkwiler, & E. Uehara
Journal of Physics A 49 (2016), no. 27, 275202
Sampling knots using orthoschemes
K. Chapman
In preparation
Funding: Simons Foundation
By Clayton Shonkwiler
In statistical physics, the basic (and highly idealized) model of a ring polymer like bacterial DNA is a closed random flight in 3-space with equal-length steps, often called an equilateral random polygon. While random flights without the closure condition are easy to simulate and analyze, the fact that the steps in a random polygon are not independent has made it challenging to develop practical yet provably correct sampling and numerical integration techniques for polygons. In this talk I will describe a geometric approach to the study of random polygons which overcomes these challenges. The symplectic geometry of the space of polygon conformations can be exploited to produce both Markov chain and direct sampling algorithms; in fact, this approach can be generalized to produce a sampling theory for arbitrary toric symplectic manifolds. This is joint work with Jason Cantarella, Bertrand Duplantier, and Erica Uehara.