MODEL FREE
MODEL FREE
MODEL BASED
Model
TD error
PROBLEMS
PROBLEMS
Gaussian Process!
SOLUTION
Model
TD error
This is what we need:
with a GP
s
a
s'
Plot shows deterministic mean, but GP handles stochastic transitions!
s
a
s'
Iterating over parents equals slicing GP at a given height
matplotlib screws the projection of
overlapping surfaces, sorry!
Taking into account uncertainty, it'd look like this