Simon Le Cleac'h
august 16, 2022
Simon Le Cleac'h
contact is the primary mode of interaction in robotics
how can we leverage models and gradient information to solve contact-rich robotics tasks?
control parameters | state parameters | model parameters |
---|
optimal control |
motion synthesis | mechanism design |
imitation learning | state estimation | system identification |
movement costs |
---|
model-data mismatch |
Emo Todorov, Optico: A Framework for Model-Based Optimization with MuJoCo Physics, NeurIPS 2019
contact physics
LCP
implicit complementarity
gradients
samples
subgradient
stability at low rates
variational integrator
interior-point methods
accurate contact dynamics
implicit differentiation
smooth gradients
Discrete mechanics and variational integrators. J. E. Marsden and M. West.
discretize
discretize
Euler-Lagrange
Euler-Lagrange
compare astronaut energy and momentum conservation to MuJoCo
MuJoCo linear
Dojo linear
MuJoCo nonlinear
Dojo nonlinear
no collision violations
correct Coulomb friction
interior-point method
impact → inequalities
friction → second-order cone
cone constraints
custom interior-point solver
nonlinear complementarity problem
custom interior-point solver
physics
physics
robot
environment
object
learned
→
residual
solution
parameters
sensitivity of solution w.r.t problem data
computation cost of gradient is less than simulation step
Lezioni di analisi infinitesimale. U. Dini.
less expensive to compute compared to finite-difference or stochastic sampling
randomized smoothing
finite difference
Dojo
matrix backward substitution
matrix factorization
matrix factorization
box push
non-smooth dynamics
gradient comparison
Dojo
randomized smoothing
differentiate intermediate barrier problems for smooth gradients
smooth-gradient-based optimization with iterative LQR
stability at low rates enables 2-5x sample-complexity improvement over MuJoCo
train static linear policies for locomotion
gradients enable 5-10x sample-complexity improvement over derivative-free method
stability at low rates enables 2-5x sample-complexity improvement over MuJoCo
ContactNets: Learning Discontinuous Contact Dynamics with Smooth, Implicit Representations. S. Pfrommer, M. Halm, and M. Posa.
learned
ground-truth
real-word dataset
Dojo environment
geometry
friction coefficient
ground-truth
learned
Quasi-Newton method utilizes gradients to
learn parameters to 95% accuracy in 20 steps
Fast Contact-Implicit Model-Predictive Control.
S. Le Cleac'h & T. Howell, C. Lee, S. Yang, M. Schwager, Z. Manchester
simulation
push recovery
behavior generation
running policy at 200-500Hz
Global Planning for Contact-Rich Manipulation via Local Smoothing of Quasi-dynamic Contact Models, Tao Pang∗, H.J. Terry Suh∗, Lujie Yang and Russ Tedrake,
RRT using the same smoothed derivatives as Dojo
Differentiable Physics Simulation of Dynamics-Augmented Neural Objects. S. Le Cleac'h, HX. Yu, M. Guo, T. Howell, R. Gao, J. Wu, Z. Manchester, M. Schwager
dynamics-augmented NeRF → complex collision geometries
control parameters | state parameters | model parameters |
---|
optimal control |
motion synthesis | mechanism design |
imitation learning | state estimation | system identification |
movement costs |
---|
model-data mismatch |
Emo Todorov, Optico: A Framework for Model-Based Optimization with MuJoCo Physics, NeurIPS 2019
differentiable simulator
online optimization-based policy
(MPC, etc.)
control inputs
gradients
gradients
offline optimization
(RL, RRT, etc.)
motion plans
trajectories
Taylor Howell
Simon Le Cleac'h
Jan Brüdigam
Zico Kolter
Mac Schwager
Zachary Manchester
Shuo Yang
Chi Yen Lee
Sumeet Singh
Simon Le Cleac'h
Pete Florence
differentiable
simulation
sensor data
point cloud data
noisy pose measurements
geometry, friction, mass, etc.
learned dynamics representation
differentiable
dynamics
differentiable
point cloud
Sumeet Singh
Simon Le Cleac'h
Pete Florence
august 10, 2022
Simon Le Cleac'h
a differentiable physics engine for robotics
Simon Le Cleac'h and Taylor Howell
Taylor Howell
Simon Le Cleac'h
thowell@stanford.edu
simonlc@stanford.edu
contact physics
LCP
implicit complementarity
gradients
samples
subgradient
stability at low rates
variational integrator
interior-point methods
accurate contact dynamics
implicit differentiation
smooth gradients
Discrete mechanics and variational integrators. J. E. Marsden and M. West.
discretize
discretize
Euler-Lagrange
Euler-Lagrange
compare astronaut energy and momentum conservation to MuJoCo
MuJoCo linear
Dojo linear
MuJoCo nonlinear
Dojo nonlinear
no collision violations
correct Coulomb friction
interior-point method
impact → inequalities
friction → second-order cone
cone constraints
nonlinear complementarity problem
custom interior-point solver
→
residual
solution
parameters
sensitivity of solution w.r.t problem data
computation cost of gradient is less than simulation step
differentiate intermediate barrier problems for smooth gradients
Lezioni di analisi infinitesimale. U. Dini.
box push
non-smooth dynamics
gradient comparison
Dojo
randomized smoothing
less expensive to compute compared to finite-difference or stochastic sampling
less expensive to compute compared to finite-difference or stochastic sampling
randomized smoothing
finite difference
Dojo
matrix backward substitution
matrix factorization
matrix factorization
Linear-Time Variational Integrators in Maximal Coordinates. J. Brudigam and Z. Manchester.
Linear-Time Contact and Friction Dynamics in Maximal Coordinates using Variational Integrators.
J. Brudigam and Z. Manchester.
smooth-gradient-based optimization with iterative LQR
stability at low rates enables 2-5x sample-complexity improvement over MuJoCo
train static linear policies for locomotion
gradients enable 5-10x sample-complexity improvement over derivative-free method
stability at low rates enables 2-5x sample-complexity improvement over MuJoCo
ContactNets: Learning Discontinuous Contact Dynamics with Smooth, Implicit Representations. S. Pfrommer, M. Halm, and M. Posa.
learned
ground-truth
real-word dataset
Dojo environment
geometry
friction coefficient
ground-truth
learned
Quasi-Newton method utilizes gradients to
learn parameters to 95% accuracy in 20 steps
Global Planning for Contact-Rich Manipulation via Local Smoothing of Quasi-dynamic Contact Models, Tao Pang∗, H.J. Terry Suh∗, Lujie Yang and Russ Tedrake,
RRT using the same smoothed derivatives as Dojo
Differentiable Physics Simulation of Dynamics-Augmented Neural Objects. S. Le Cleac'h, HX. Yu, M. Guo, T. Howell, R. Gao, J. Wu, Z. Manchester, M. Schwager
dynamics-augmented NeRF → complex collision geometries
Fast Contact-Implicit Model-Predictive Control.
S. Le Cleac'h & T. Howell, C. Lee, S. Yang, M. Schwager, Z. Manchester
simulation
push recovery
behavior generation
running policy at 200-500Hz
nonlinear complementarity problem (NCP)
impact force
nonlinear complementarity problem (NCP)
impact force
slack variable
nonlinear complementarity problem (NCP)
impact force
slack variable
impact force
slack variable
nonlinear complementarity problem (NCP)
linear complementarity problem (LCP)
preserve contact reasoning
→ adapt contact sequence online
preserve contact reasoning
→ adapt contact sequence online
computational gains
→ real-time performance
dynamics evaluation
dynamics gradient
solve LCP problem
differentiate LCP problem
contact-implicit trajectory optimization
contact-implicit MPC
hardware transfer
contact-implicit trajectory optimization
contact-implicit MPC
hardware transfer
contact-implicit trajectory optimization
contact-implicit MPC
hardware transfer
Taylor Howell
Simon Le Cleac'h
Jan Brüdigam
Zico Kolter
Mac Schwager
Zachary Manchester
Shuo Yang
Chi Yen Lee
august 9, 2022
Simon Le Cleac'h
a differentiable physics engine for robotics
Simon Le Cleac'h and Taylor Howell
Taylor Howell
Simon Le Cleac'h
thowell@stanford.edu
simonlc@stanford.edu
Dojo key ideas
minimal vs. maximal coordinates
benefits of differentiability
applications of Dojo
contact physics
LCP
implicit complementarity
gradients
samples
subgradient
stability at low rates
variational integrator
interior-point methods
accurate contact dynamics
implicit differentiation
smooth gradients
Discrete mechanics and variational integrators. J. E. Marsden and M. West.
discretize
discretize
Euler-Lagrange
Euler-Lagrange
compare astronaut energy and momentum conservation to MuJoCo
MuJoCo linear
Dojo linear
MuJoCo nonlinear
Dojo nonlinear
no collision violations
correct Coulomb friction
interior-point method
impact → inequalities
friction → second-order cone
non linear complementarity problem
nonlinear complementarity problem
custom interior-point solver
→
residual
solution
parameters
nonlinear complementarity problem
sensitivity of solution w.r.t problem data
computation cost of gradient is less than simulation step
differentiate intermediate barrier problems for smooth gradients
Lezioni di analisi infinitesimale. U. Dini.
less expensive to compute compared to finite-difference or stochastic sampling
randomized smoothing
finite difference
Dojo
matrix backward substitution
matrix factorization
matrix factorization
box push
non-smooth dynamics
gradient comparison
Dojo
randomized smoothing
less expensive to compute compared to finite-difference or stochastic sampling
Linear-Time Variational Integrators in Maximal Coordinates. J. Brudigam and Z. Manchester.
Linear-Time Contact and Friction Dynamics in Maximal Coordinates using Variational Integrators.
J. Brudigam and Z. Manchester.
smooth-gradient-based optimization with iterative LQR
stability at low rates enables 2-5x sample-complexity improvement over MuJoCo
train static linear policies for locomotion
gradients enable 5-10x sample-complexity improvement over derivative-free method
stability at low rates enables 2-5x sample-complexity improvement over MuJoCo
ContactNets: Learning Discontinuous Contact Dynamics with Smooth, Implicit Representations. S. Pfrommer, M. Halm, and M. Posa.
learned
ground-truth
real-word dataset
Dojo environment
geometry
friction coefficient
ground-truth
learned
Quasi-Newton method utilizes gradients to
learn parameters to 95% accuracy in 20 steps
contact-implicit trajectory optimization
contact-implicit MPC
hardware transfer
contact-implicit trajectory optimization
contact-implicit MPC
hardware transfer
contact-implicit trajectory optimization
contact-implicit MPC
hardware transfer
Fast Contact-Implicit Model-Predictive Control.
S. Le Cleac'h & T. Howell, C. Lee, S. Yang, M. Schwager, Z. Manchester
simulation
push recovery
behavior generation
running policy at 200-500Hz
Differentiable Physics Simulation of Dynamics-Augmented Neural Objects. S. Le Cleac'h, HX. Yu, M. Guo, T. Howell, R. Gao, J. Wu, Z. Manchester, M. Schwager
dynamics-augmented NeRF → complex collision geometries
Taylor Howell
Simon Le Cleac'h
Jan Brüdigam
Zico Kolter
Mac Schwager
Zachary Manchester
Shuo Yang
Chi Yen Lee
march 4, 2022
Simon Le Cleac'h and Taylor Howell
Simon Le Cleac'h and Taylor Howell
Dojo
MuJoCo
Dojo
MuJoCo
approximation
exact
task: push object up to goal
simonlc@stanford.edu (Simon)
thowell@stanford.edu (Taylor)