Sept 11, 2025
Adam Wei
Part 1
Sweeps and Performance Improvements
North Star Goal: Train with internet scale data (Open-X, AgiBot, etc)
Stepping Stone Experiments:
\(\sigma_{min}^i = \inf\{\sigma\in[0,1]: c_\theta (x_\sigma, \sigma) > 0.5-\epsilon\}\)
\(\implies \sigma_{min}^i = \inf\{\sigma\in[0,1]: d_\mathrm{TV}(p_\sigma, q_\sigma) < \epsilon\}\)*
* assuming \(c_\theta\) is perfectly trained
Performance is sensitive to classifier and \(\sigma_{min}\) choice!
\(\sigma_{min}^i = \inf\{\sigma\in[0,1]: c_\theta (x_\sigma, \sigma) > 0.5-\epsilon\}\)
\(\implies \sigma_{min}^i = \inf\{\sigma\in[0,1]: d_\mathrm{TV}(p_\sigma, q_\sigma) < \epsilon\}\)*
Part 2
Scaling and Algorithmic Issues
Hypothesis: As sim data increases, ambient diffusion approaches "sim-only" training, which harms performance.
\(|\mathcal{D}_S|=500\)
\(|\mathcal{D}_S|=4000\)
Thought experiment
\(\mathcal{X} = \{-1, 0, 1\}\)
Choosing \(\sigma_{min}\) per dataset
Classifier assigns some \(\sigma_{min}\) to all datapoints from \(q_0\)
\(p_0\) samples 0 and 1 w.p. 0.5
\(q_0\) samples -1 and 0 w.p. 0.5
Choosing \(\sigma_{min}\) per datapoint
Classifier assigns \(\sigma_{min}=0\) to all 0's in \(q_0\) and \(\sigma_{min}=1\) to all -1's
\(\implies\) model sees more 0's during training \(\implies\) heavily biased model!
Choosing \(\sigma_{min}\) per "bucket"
"Soft" Ambient Diffusion
Part 3
Locality and \(\sigma_{max}\)
Ambient Diffusion: "use low-quality data at high noise levels"
Ambient Diffusion Omni: "use OOD with local similarity at low noise levels"
Intuition
Therefore, we can use OOD data to learn local features at low noise levels
Which photo is the cat?
Which photos is the cat?
Which photo is the cat?
Ambient Diffusion Omni: "use OOD with local similarity at low noise levels"
Intuition: If the receptive field at \(\sigma_{max}\) is sufficiently small that you cannot distinguish cats from dogs, then you can use cats to train a generator for dogs
Repeat:
\(\sigma=0\)
\(\sigma_{max}\)
Corrupt Data (\(\sigma\geq\sigma_{min}\))
Clean Data
Corrupt Data (\(\sigma\leq\sigma_{max}\))
\(\sigma_{min}\)
Locality in robotics:
Experiment:
(equivalent to cats vs dogs experiment in ambient diffusion omni)