Planning through Contact by
Smoothing Quasidynamic Models
Pang, Terry, Lu
High-level Story
Previous Story:
Better RRT by Mode Smoothing
New Story, Candidate 1:
Smooth Quasidynamic Systems for More Informative Local Models
New Story, Candidate 2:
Equivalence of Randomized and Analytic Smoothing
High-level: We understood RL solves complex contact problems by randomized smoothing. We establish this is equivalent to analytic smoothing, and show we can do faster computation by leveraging structure.
High-level: Planning through Contact is difficult because gradient-based local models are not useful. We show smoothing and quasidynamic formulations provide more informative local models of dynamics, which we leverage for trajopt / rrt.
High-level: Model-based solutions for RRT has been stuck because of mode considerations. We show that by smoothing out these modes, we can achieve much more scalable performance.
-
More focus on why quasidynamic models and smoothing lead to better locally linear models.
-
Emphasize contribution on smooth formulations of optimization-based quasidynamic models.
-
More focus on structure, gradients, etc.
-
Less focus on specific algorithms (trajopt, rrt, etc.)
-
More focus on how previous algorithms have focused explicitly on modes and how smoothing achieves successful abstraction without combinatorial blowup.
-
Emphasize contribution on algorithm and performance.
-
Less emphasis on quasidynamic models, more emphasis on equivalence between smoothing schemes. More "RL" story.
-
Bit of an "underdog" story - now that we understand what's good about RL, we can achieve better performance with traditional methods by incorporating the lessons.
-
But might need to compare with RL (but this is kind of difficult given that no RL methods solves these problems under a minute.)
-
Less specific algorithms
How we want to improve the paper from here is highly coupled with what story we want to ultimately tell.
Important Questions: How to include Trajopt
Candidate 1 (current version):
RRT + Projection, RRT + Trajopt
Candidate 2:
Just present RRT + Projection
Candidate 3:
Separate section for comparing Trajopt, RRT only exists with Projection.
-
Free from arguing some of the difficult questions that we don't necessarily want to argue. (Russ's question, why does local metric help trajopt. Current Answer: it helps for some systems but doesn't for others.)
-
The contribution might seem quite algorithm-specific (i.e. another RRT paper).
-
Less work to do, we simply attack local trajopt based on how it's hard to deal with global optimization.
-
Feels like we haven't committed ourselves to one of the variants
-
RRT + Trajopt has trends that are slightly harder to explain because of local metric only looks one-step, while trajopt extends multi-step.
-
Comparison between the two variants is quite expected, nothing new here.
-
Shows that the contribution is 'agnostic' of the method chosen, as long as the method relies on some local model of the dynamics. (People have pointed out that smoothing helps traj-opt, but we could focus on how quasi-dynamic and our choices of A and B matrices help.)
-
Free from arguing some of the difficult questions that we don't necessary want to argue. (Russ's question, why does local metric help trajopt)
Experiment Design
Experiments designed according to points we want to make.
Argument 1
Using the Mahalanobis metric for the nearest step is better for growing the tree.
Argument 2
Analytic and Randomized smoothing results in similar performances, with analytic resulting in faster compute time.
Argument 3
Smoothing is necessary compared to using exact gradients.
Progress Updates: Analytic Smoothing
Previous formulation w/ pyramid friction cone.
Log-barrier formulation of the constraint.
t (log-barrier strength) acts as the smoothing parameter, where larger t is makes the approximation sharper.
Formulation into exponential cone, solver now uses MOSEK.
Equivalence of Smoothing Schemes
For simple examples, we can ask: what is the equivalent probability kernel that results in analytical smoothing?
Leaving out some of the constants, the obtained distribution is
some variant of the heavy-tailed Cauchy distribution,
Which can be more generally written in elliptical form
Back to our favorite system of wall against block.
Planar Hand Comparison
Running time vs. Goal for different methods. (note: this evaluation was made on pyramid cone).
Method | Running time | Distance to Goal |
---|---|---|
Analytic | 08.8628s | 0.0811 |
Randomized 100 | 19.0919s | 1.0885 |
Randomized 50 | 14.7371s | 1.8158 |
Randomized 10 | 09.3223s | 20.0578 |
1000 iterations
Projection
Not only theoretically satisfying, but results in less compute time thanks to MOSEK's cone solver being comparable with (but still slower than) Gurobi's QP solver.
Trajopt
The trend seems not so clear anymore. Randomized 10 seems to perform just as well as analytic with the same amount of wall-clock time when used as a subroutine for RRT.
Perhaps it's better to separately show that individual instances of trajopt achieve lower cost, rather than combining with RRT.
TODOs
- SOCP Formulation of the friction cone.
- A matrix.
Circular friction cones
Pyramidal friction cones
Barrier formulation (unconstrained)
This can be modeled with the exponential cone and solved with Mosek.
This is convex, but I'm not sure how to tell Mosek about its convexity...
...So we wrote our own (naive) Newton's method and it's faster than mosek.
Central Path
- We do the centering step once, for a fixed t, e.g. \(t=100\)
- Mosek/Gurobi solves the centering step for many \(t\), from small to large.
Barrier Formulation
log barrier function
The A matrix struggle
- In the bundled dynamics paper, we said
- But in practice, for quasistatic systems, we found that using \(\mathbf{\hat{A}}_t = I\) works pretty well too.
- \(I\) is the gradient when there's no contact.
- Not computing A has computational benefits:
- Derivatives of the contact points and contact normal are NOT needed.
- Derivatives of the contact Jacobians are not needed.
Russ_Update_03_25
By Terry Suh
Russ_Update_03_25
- 73