Active Tactile Exploration for Dynamic Pose and Shape Estimation

Ethan K. Gordon, Bruk Baraki, Hien Bui, Michael Posa

Bidirectional Chamfer Distance (bCH) Evaluation Metric
(Left) Simulated results for cuboid and polytope parameterizations.
(Top Right) Real robot results
Significant improvement over object-directed actions from random approach directions.
(Bottom Right) Example learning curves. Observed information increases with more data.

Trifinger Robot
Densetact 2.0 (Do, 2023)
Contact Boolean \(c_t\) and Normal \(\hat{n}_t\) are computed with Optical Flow and a Helmholz-Hodge Decomposition
Realsense for Ground-Truth only

Each action: 1s trajectory towards the current guess
Baseline (Random): Randomize approach direction
EIG (Ours): Choose the approach to maximize EIG, sampling-based optimization with Gaussian Cross-Entropy Method

Learning a physical model online can improve data efficiency, predictability, and reuse between tasks.
Previous Tactile System Identification Work: Static Objects and/or Strong Shape Priors and/or 2D
Learning: a violation-implicit loss function penalizes physical constraint violation while avoiding the numerical stiffness inherent in rigid-body contact.
We learn cuboid and convex polyhedra with less than 10s of randomly collected data.
Exploration: maximizing Expected Info Gain (EIG) leads to significantly faster learning.

Only tactile data is used to find the pose and geometry of an arbitrary dynamic convex object.

Learning Difficulty: gradients of dynamics \(f\) are near-0 or near-\(\infty\), which is not amenable to learning.
Solution: add inner optimization over contact forces \(\lambda\). Trade-off: solving a (fast) QP each gradient step for better gradients.

\(\sum_t \mathbb{P}(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1}, \lambda)||^2\)

Where \(\lambda = \min g_\theta(x, \lambda)\)

\(\mathcal{L} = \sum_t \min_\lambda\mathbb{P}(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1})||^2 + g_\theta(x, \lambda)\)

Physics Constrained MLE with Trajectory Optimization

Violation-Implicit Loss

\(\mathcal{I} = \sum_{m_t} \nabla\mathcal{L}(\theta, x_T) \left(\nabla \mathcal{L}(\theta, x_T\right)^T\)

Measurements \(m_t\)

\(\mathcal{F} = \mathbb{E}_{m_t}\left[\nabla\mathcal{L}(\theta, x_T) \left(\nabla \mathcal{L}(\theta, x_T\right)^T\right]\)

Sample future actions, Simulate to estimate \(\mathcal{F}\), then maximize Expected Information Gain (EIG)

\(EIG := \log\det\left(\mathcal{F}\mathcal{I}^{-1} + \mathbf{I}\right)\)

By Michael Posa

30" H x 40" W poster