Jan 21, 2026
Adam Wei
1. Book keeping items
2. Real-world results
3. Motion planning experiments
4. Conditioning and ambient in the multi-task setting
robot teleop
simulation
Open-X
Open-X
Open-X
Variant of Ambient Omni
Cool Task!!
Cool Demo!!
Open-X
Magic Soup++: 27 Datasets
Custom OXE: 48 Datasets
Ambient policy on OOD objects. (2x speed)
Task completion =
  0.1 x [opened drawer]
+ 0.8 x [# obj. cleaned / # obj.]
+ 0.1 x [closed drawer]
Question: How to compute error bars?
\(\sigma=0\)
\(\sigma=1\)
"Clean" Data
\(\sigma=0\)
\(\sigma=1\)
"Corrupt" Data
"Clean" Data
... with some mixing ratio \(\alpha\)
\(\sigma=0\)
\(\sigma=1\)
\(\sigma_{min}\)
\(\sigma > \sigma_{min}\)
"Clean" Data
"Ambient"
\(\sigma=0\)
\(\sigma=1\)
\(\sigma_{max}\)
\(\sigma \leq \sigma_{max}\)
"Clean" Data
"Locality"
\(\sigma=0\)
\(\sigma=1\)
\(\sigma_{max}\)
\(\sigma \leq \sigma_{max}\)
"Clean" Data
"Locality"
\(\sigma > \sigma_{min}\)
"Ambient"
\(\sigma_{min}\)
Even the best policy right now is mediocre...
Best 50 demo policy: 83.0%
150 clean demos: 80.8%
Reweighted
(sample clean 50%)
Unweighted
(sample clean 0.06%)
\(\sigma=0\)
\(\sigma=1\)
\(\sigma_{min}\)
\(\sigma > \sigma_{min}\)
"Clean" Data
"Ambient"
Noise observation w.p. \(p\)
Produce model with noise level of observation
\(\sigma=0\)
\(\sigma=1\)
\(\sigma_{min}\)
\(\sigma > \sigma_{min}\)
"Clean" Data
"Ambient"
Noise observation w.p. \(p\)
Produce model with noise level of observation
Inference time:Â set noise level of observations to 0
Distribution shift: Low-quality, noisy trajectories
High Quality:Â
100 GCS trajectories
Low Quality:Â
5000 RRT trajectories
GCS
Success Rate
(Task-level)
Avg. Jerk^2
(Motion-level)
RRT
GCS+RRT
(Co-train)
GCS+RRT
(Ambient)
50%
Policies evaluated over 100 trials each
100%
7.5k
17k
91%
14.5k
98%
5.5k
Generate good (expensive)Â and bad (cheap)Â motion planning data in 20,000 environments
Goal: generate high-quality, collision-free trajectories in new environments
Set up
Â
Results
Primary Goal: To provide intuition for ambient diffusion in a simple environment
Secondary Goal: General neural motion planner
Clean data:
Corrupt data
Observation space: proprioception, target joint position
Action space: joint position commands
Trajopt
Success Rate
(Task-level)
Avg. Length
(Motion-level)
RRT
Trajopt + RRT
(Co-train)
Trajopt+RRT(Ambient)
30.9%
Policies evaluated over 1000 trials each
?
5.97
?
24.1%
Avg. Acc^2
(Motion-level)
2.75
?
11.24
7.32
38.5%
6.72
3.57