Q-Learning on Vacuum Cleaner

Progetto di

Francesco (Galt) Faloci

"Interactive intelligent devices, systems and environments"

Q-Learning on Vacuum Cleaner

Q-Learning on Vacuum Cleaner

Q-Learning

Q-Learning on Vacuum Cleaner

Model Oriented

Training

Model

Class Set X

Training Set K

Class Set Y

Q-Learning on Vacuum Cleaner

Model-Free Oriented

Training

Set X

Strategy Set X+1

New Set X+1

Q-Learning on Vacuum Cleaner

Q (Stato, Azione)

Q t-1 (Stato, Azione) + Apprendimento (Stato, Azione)

Q t (Stato, Azione)

Q tnew (Stato, Azione) =
Gain tnew + [max Q (Stato tnew, Azione tnew) - Q t-1 (Stato, Azione) ]

Q-Learning on Vacuum Cleaner

Q-Learning on Vacuum Cleaner

Vacuum Cleaner

https://github.com/aimacode/aima-python

Q-Learning on Vacuum Cleaner

Vacuum Cleaner

Q-Learning on Vacuum Cleaner

Vacuum Cleaner

Q-Learning on Vacuum Cleaner

Vacuum Cleaner

Q-Learning on Vacuum Cleaner

Prova sul campo...

Grazie dell'Attenzione

</END>

Q-Learning on Vacuum Cleaner