Tractable Adaptability
Ethan K. Gordon
Postdoc, University of Pennsylvania
PhD 2023, University of Washington


Active Learning for Contact-Rich Assistive Manipulation

Ethan at Lehigh:

Physically Assistive Robots (PARs)
“If I can have a robot do it, I can learn to adapt to it, but it would be me feeding me, and that would be huge”
Tyler Schrenk
1985-2023


The Promise of PARs:
- Empowerment
- Independence
Tractable Adaptability
How can robots efficiently learn, during deployment, how to manipulate previously-unseen objects?
What is needed for pHRI?
Contact-Rich Manipulation
- Sliding to clean the spoon and bowl
- Shaking to smoothen
- In-Mouth Hand-Off
(vision-denied)
Online Adaptation
- Bite Size Adjustment
What is needed for pHRI?
Online Adaptation
- Totally Different Food
- Multi-bite: different shapes for each bite
There is no time for
re-training!
The Technology/Application Cycle
Support
Inform
Physically Assistive Robots (PARs)
Active Learning in Contact







The Technology/Application Cycle
Support
Inform
Physically Assistive Robots (PARs)
Active Learning in Contact
HRI 2025
HRI 2020
HRI 2024
NIPS 2018
RA-L 2022
ICRA 2023
ICRA 2022
Preprint 2026
ISRR 2019
IROS 2020
ICRA 2023
CoRL 2023
Autonomous Bite Acquisition
Problem: Bite Acquisition
Online Bite Acquisition Challenges
- Large Action Space (whole trajectory) OR Sparse Reward
- Unknown Dynamics / State Transition: food simulation is hard!
Solution: Data-Driven Policy-Space Simplification



(Gordon, CoRL 2023)Learn This Online!
Emergent Behavior
Wiggling
Tilting
High Pressure
Scooping
Multimodal Online Learning
- High-Dimension State Space: Foods are really diverse!
Solution: Haptic Policy Regularization
(Bhattacharjee, R-AL 2019); (Gordon, IROS 2020); (Gordon, ICRA 2021)

Optimize Jointly
Active Learning in Contact
Dynamic Object System Identification
Choose:
-
Robot Trajectory \(r[t]\)
Measure:
-
Contact Boolean \(c_t\)
-
Contact Normal \(\hat{n}_t\)
-
Proprioception
Find:
-
Object Geometry \(\theta^*\)
-
Object Pose \(x^*_T\)
Learning with a Violation-Implicit Loss
Information Maximization In Action



Expected Information Gain (EIG)
Learn; Compute
Observed Info \(\mathcal{I}\)
Sample + Simulate
Expected Fisher Info \(\mathcal{F}\)
\(\max EIG := \log\det\left(\mathcal{F}\mathcal{I}^{-1} + \mathbf{I}\right)\)
Choose actions where simulated, expected Fisher info is distinct from Observed info.




Robot-Assisted Feeding
User Studies Capture Diversity
(Bhattacharjee, HRI 2020)User Studies Capture Metrics
(Bhattacharjee, HRI 2020)Trade-off between autonomy (with chance of error) and high-effort manual control.
What errors are tolerable?
Community-Based Participatory Design
(Gordon, HRI Companion 2024)Community-Based Participatory Design
(Nanavati, HRI 2025)Research Plans
Safe Active Exploration in Contact


(ActSafe, ICLR 2025)Research Plans
Dressing "Acquisition"


(TOORAD, Autonomous Robots 2019)Research Plans
Multi-Function PARs




?
Thank you!










DAIR Lab
Amal Nanavati




Tractable Adaptability
Ethan K. Gordon
Postdoc, University of Pennsylvania
PhD 2023, University of Washington


Active Learning for Contact-Rich Assistive Manipulation
Structure Through Expert-Defined Heuristics
- Qualitative Taxonomy of Single-Utensil Bite Acquisition

- Convert to Action Schema:
- \(\mathbb{R}^{14} \times SO(3) \times S^2\)
- Force and Torque Thresholds
Benefits: Interpretable, Continuity (Similar Numbers \(\rightarrow\) Similar Action)
(Gordon, CoRL 2023); (Bhattacharjee, R-AL 2019) 
Data-Driven Discretization

(Gordon, CoRL 2023)


Exploration vs. Exploitation: Contextual Bandits

(Gordon, IROS 2020); SPANet from (Feng, IJRR 2019)
Incorporating Haptic Information
(Gordon, ICRA 2021); (Bhattacharjee, R-AL 2019)

Classification with 50ms of 6DOF F/T Data

\(l_t = c_t^T\theta^* + \epsilon_\theta = p_t^T\phi^* + \epsilon_\phi\)
Optimize both simultaneously, regularizing them against each other.
Exploration vs. Exploitation: Contextual Bandits

(Gordon, ICRA 2021)Active Learning for pHRI: Spinning the Flywheel






User-Informed:
Metrics
Priorities
Limitations
Contact-Rich Active Learning:
Model-Based
Policy Simplification
Multimodal Sensing
Community-Based:
System Design
Implementation
Pain Point Identification
Lehigh Zoom Interview
By Michael Posa
Lehigh Zoom Interview
3-5min
- 5