Active Tactile Exploration for Dynamic Pose and Shape Estimation


Ethan K. Gordon, Bruk Baraki, Hien Bui, Michael Posa
https://dairlab.github.io/activetactile
Supported by: NSF CAREER Award FRR-2238480, RAI Institute
ethankg@seas.upenn.edu
RESULTS



- Bidirectional Chamfer Distance (bCH) Evaluation Metric
- (Left) Simulated results for cuboid and polytope parameterizations.
- (Top Right) Real robot results
- Significant improvement over object-directed actions from random approach directions.
- (Bottom Right) Example learning curves. Observed information increases with more data.

EXPERIMENT SETUP

- Trifinger Robot
- Densetact 2.0 (Do, 2023)
- Contact Boolean \(c_t\) and Normal \(\hat{n}_t\) are computed with Optical Flow and a Helmholz-Hodge Decomposition
- Realsense for Ground-Truth only
INTRODUCTION
- Each action: 1s trajectory towards the current guess
- Baseline (Random): Randomize approach direction
- EIG (Ours): Choose the approach to maximize EIG, sampling-based optimization with Gaussian Cross-Entropy Method
- Learning a physical model online can improve data efficiency, predictability, and reuse between tasks.
- Previous Tactile System Identification Work: Static Objects and/or Strong Shape Priors and/or 2D
- Learning: a violation-implicit loss function penalizes physical constraint violation while avoiding the numerical stiffness inherent in rigid-body contact.
- We learn cuboid and convex polyhedra with less than 10s of randomly collected data.
- Exploration: maximizing Expected Info Gain (EIG) leads to significantly faster learning.

Only tactile data is used to find the pose and geometry of an arbitrary dynamic convex object.
METHOD AND FRAMEWORK

- Learning Difficulty: gradients of dynamics \(f\) are near-0 or near-\(\infty\), which is not amenable to learning.
- Solution: add inner optimization over contact forces \(\lambda\). Trade-off: solving a (fast) QP each gradient step for better gradients.
\(\sum_t \mathbb{P}(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1}, \lambda)||^2\)
Where \(\lambda = \min g_\theta(x, \lambda)\)
\(\mathcal{L} = \sum_t \min_\lambda\mathbb{P}(m_t | \theta, x_t) + ||x_t - f_\theta(x_{t-1})||^2 + g_\theta(x, \lambda)\)
Physics Constrained MLE with Trajectory Optimization
Violation-Implicit Loss
- Observed Information: empirical variance of loss gradient
\(\mathcal{I} = \sum_{m_t} \nabla\mathcal{L}(\theta, x_T) \left(\nabla \mathcal{L}(\theta, x_T\right)^T\)
Measurements \(m_t\)
- Fisher Information: expected variance given future \(m_t\)
\(\mathcal{F} = \mathbb{E}_{m_t}\left[\nabla\mathcal{L}(\theta, x_T) \left(\nabla \mathcal{L}(\theta, x_T\right)^T\right]\)
- Sample future actions, Simulate to estimate \(\mathcal{F}\), then maximize Expected Information Gain (EIG)
\(EIG := \log\det\left(\mathcal{F}\mathcal{I}^{-1} + \mathbf{I}\right)\)
Ethan GRASP Summit Poster
By Michael Posa
Ethan GRASP Summit Poster
30" H x 40" W poster
- 16