Certainty Equivalent Perception-Based Control

Sarah Dean and Benjamin Recht

UC Berkeley

L4DC, 8 June 2021

Problem setting

$\text{s.t.}~~x_{t+1} = {A }x_t+ {B} u_t$

Robust reference tracking with linear dynamics and nonlinear partial observation

$z_{t} =g(Cx_t)$

$\displaystyle\mathrm{cost} = \sup_{\substack{t\geq 0\\\mathbf x^\mathrm{ref} \in \mathcal R\\ \|x_0\|\leq \sigma_0}}\left\|\begin{bmatrix} Q (x_t - x_t^\mathrm{ref})\\ Ru_t \end{bmatrix}\right\|_\infty$

Assumption 1:

$A,B,C$ and $Q,R$ are known and well posed

Assumption 2:

$\mathcal R$ encodes a bounded radius of operation

Assumption 3:

Invertible $h(g(y)) = y$ and $g,h$ continuous

$\displaystyle \min_{\pi}$

$\displaystyle \min_{\mathbf K}$

$u_{t} =\pi(z_{0:t}, x^\mathrm{ref}_{0:t})$

Assumption 4:

Noisy training signal $y^\mathrm{train}_{t} =Cx_t+\eta_t$

$y_t = h(z_t) = Cx_t$

$u_t = \mathbf K(y_{0:t}, x^\mathrm{ref}_{0:t})$

Certainty equivalent controller $\widehat \pi(z_{0:t}, x^\mathrm{ref}_{0:t}) = \mathbf K_\star (\widehat h(z_{0:t}), x^\mathrm{ref}_{0:t})$
where $\widehat h$ is learned from data

Transform to linear output feedback problem with $h$

$\pi_\star(z_{0:t}, x^\mathrm{ref}_{0:t}) = \mathbf K_\star (h(z_{0:t}), x^\mathrm{ref}_{0:t})$

$\mathrm{dynamics~\&}$ $\mathrm{observation}$

$\mathbf K$

$z_t$

$u_t$

$x_t$

$y_t$

$\mathrm{linear}$
$\mathrm{dynamics}$

$\mathbf K$

$y_t$

$u_t$

$x_t$

$h$

Certainty Equivalent Perception-Based Control Sarah Dean and Benjamin Recht UC Berkeley L4DC, 8 June 2021

Certainty Equivalent Perception-Based Control

By Sarah Dean

Certainty Equivalent Perception-Based Control

4 years ago
1,850

Sarah Dean PRO

asst prof in CS at Cornell

sdean.website

Certainty Equivalent Perception-Based Control

Sarah Dean and Benjamin Recht

UC Berkeley

Motivation

Perception-based optimal control

Perception-based optimal control

Perception-based optimal control

Perception-based optimal control

Problem setting

Main Result (Informal)

Related Work

Necessity of pointwise bounds

Nonparametric regression

Nonparametric regression

Sampling with linear control

Closed-loop guarantees

Main Result

Simulation Experiments

Data-driven perception

Perception errors

Thanks for your attention!

Certainty Equivalent Perception-Based Control

Certainty Equivalent Perception-Based Control

Sarah Dean PRO

Certainty Equivalent Perception-Based Control

Sarah Dean and Benjamin Recht

UC Berkeley

Certainty Equivalent Perception-Based Control

More from Sarah Dean