Reinforcement Learning

(Part 2)

MIT 6.421:

Robotic Manipulation

Fall 2023, Lecture 20

Follow live at https://slides.com/d/HoT1aag/live

(or later at https://slides.com/russtedrake/fall23-lec20)

Reinforcement Learning (Part 2) MIT 6.421: Robotic Manipulation Fall 2023, Lecture 20 Follow live at https://slides.com/d/HoT1aag/live (or later at https://slides.com/russtedrake/fall23-lec20)

Lecture 20: Reinforcement Learning (part 2)

By russtedrake

Lecture 20: Reinforcement Learning (part 2)

MIT Robotic Manipulation Fall 2023 http://manipulation.csail.mit.edu

a year ago
1,276

russtedrake PRO

Roboticist at MIT and TRI

people.csail.mit.edu/russt

Reinforcement Learning

Beware "artificial" discontinuities

Smoothing with stochasticity

Smoothing with stochasticity for Multibody Contact

Do Differentiable Simulators Give Better Policy Gradients?

Randomized smoothing

Lessons from stochastic optimization

Example: The Heaviside function

What about smooth (but stiff) approximations?

First-order estimates can also have high variance

First-order estimates can also have high variance

Is stochasticity essential?

Deterministic smoothing - force at a distance

Lecture 20: Reinforcement Learning (part 2)

Lecture 20: Reinforcement Learning (part 2)

russtedrake PRO

Reinforcement Learning

Lecture 20: Reinforcement Learning (part 2)

More from russtedrake