\(\mathcal{L} = -\log p(m_t | \theta, x_{<T}) =\sum_t -\log p(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1}, \lambda)||^2\)
Where \(\lambda = \min g_\theta(x, \lambda)\)
\(\mathcal{L} = \sum_t \min_\lambda -\log p(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1})||^2 + g_\theta(x, \lambda)\)
Physics Constrained MLE with Trajectory Optimization
Violation-Implicit (VIMP) Loss
\(\mathcal{I} = \sum_{m_t} \left(\nabla_{\theta, x_T}\log p(m_t|\theta, x_T)\right)^2\)
\(\mathcal{F} = \mathbb{E}_{m_t}\left[\mathcal{I}(m_t, \theta, x_{H>T})\right]\)
\(EIG := \log\det\left(\mathcal{F}\mathcal{I}^{-1} + \mathbf{I}\right)\)
w.r.t. Current Time T
w.r.t. Future Time H
Challenge: How to compute \(\nabla \log p(m_t | x_T)\), i.e. sensitivity of past measurements to the future state?
Computing \(g=\nabla \log p(m_t | x_T)\)
Rejected: Backwards Simulation
\(g = \nabla \log p(m_t|x_t(x_T))\)
Ill-defined for frictional contact.
Baseline 1: Identity Jacobian
\(\nabla_{x_T}x_t = \nabla_{x_t}x_T = \mathbb{I}\)
Treats object as quasi-static
Baseline 2: Diffsim
Compute \(g=\nabla \log p (m_t|x_0)\) instead.
Poorly conditioned numerically
Proposed: Marginalize + Sample
\(g \approx \nabla \log \sum_{x_t}p(m_t|x_t)p(x_t|x_T)\)
Sample \(x_t\) w/ MCMC, use vimp loss \(\mathcal{L}\)
\(\approx softmax_{x_t}(\log p(m_t|x_t)) \cdot \nabla\mathcal{L}\)
Hardware Setup
Experiment Design
Project Website
Next Steps: expand experiment to proposed (marginalize + sample) EIG formulation.
Challenge: Use only tactile data to find the pose and geometry of an arbitrary dynamic convex object.
Fast sampling with learned trajectory
\(-\log p(m_t | \theta, x_{<T}) =\sum_t -\log p(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1}, \lambda)||^2\) s.t. \(\lambda = \min g_\theta(x, \lambda)\)
\(\mathcal{L} = \sum_t \min_\lambda -\log p(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1})||^2 + g_\theta(x, \lambda)\)
Physics Constrained MLE with Trajectory Optimization
Violation-Implicit (VIMP) Loss
\(\mathcal{I} = \sum_{m_t} \left(\nabla_{\theta, x_T}\log p(m_t|\theta, x_T)\right)^2\)
\(\mathcal{F} = \mathbb{E}_{m_t}\left[\mathcal{I}(m_t, \theta, x_{H>T})\right]\)
\(EIG := \log\det\left(\mathcal{F}\mathcal{I}^{-1} + \mathbf{I}\right)\)
w.r.t. Current Time T
w.r.t. Future Time H
Challenge: How to compute \(g=\nabla \log p(m_t | x_T)\), i.e. sensitivity of past measurements to the future state?
Rejected: Backwards Simulation
\(g=\nabla \log p(m_t|x_t(x_T))\)
Ill-defined for frictional contact.
Baseline 1: Identity Jacobian \(\nabla_{x_T}x_t = \nabla_{x_t}x_T = \mathbb{I}\)
Treats object as quasi-static
Fails for left example: \(\mathcal{F}\) only non-0 for 1 face.
Baseline 2: Differentiable Simulation
Compute \(g=\nabla \log p (m_t|x_0)\) instead.
Poorly conditioned numerically
Same as learning difficulty (above).
Proposed: Marginalize + Sample
\(g \approx \nabla \log \sum_{x_t}p(m_t|x_t)p(x_t|x_T)\)
\(\approx softmax_{x_t}(\log p(m_t|x_t)) \cdot \nabla\mathcal{L}\)
Sample \(x_t\) w/ MCMC, use vimp loss \(\mathcal{L}\)
Project Website
Next Steps: expand experiment to proposed (marginalize + sample) EIG formulation.
Challenge: Use only tactile data to find the pose and geometry of an arbitrary dynamic convex object.
Fast sampling with learned trajectory
Example (right): We have only touched 1 face, but the dynamics provides info about 3 faces.
\(m_1\)
\(x_1\)
\(m_T\)
\(x_T\)
\(m_1\)
\(x_1\)
\(m_2\)
\(x_2\)