Q-Learning on Vacuum Cleaner
Francesco (Galt) Faloci
Q-Learning on Vacuum Cleaner
Q-Learning on Vacuum Cleaner
Q-Learning
Q-Learning on Vacuum Cleaner
Model Oriented
Training
Model
Class Set X
Training Set K
Class Set Y
Q-Learning on Vacuum Cleaner
Model-Free Oriented
Training
Set X
Strategy Set X+1
New Set X+1
Q-Learning on Vacuum Cleaner
Q (Stato, Azione)
Q t-1 (Stato, Azione) + Apprendimento (Stato, Azione)
Q t (Stato, Azione)
Q tnew (Stato, Azione) =
Gain tnew + [max Q (Stato tnew, Azione tnew) - Q t-1 (Stato, Azione) ]
Q-Learning on Vacuum Cleaner
Q-Learning on Vacuum Cleaner
Vacuum Cleaner
Q-Learning on Vacuum Cleaner
Vacuum Cleaner
Q-Learning on Vacuum Cleaner
Vacuum Cleaner
Q-Learning on Vacuum Cleaner
Vacuum Cleaner
Q-Learning on Vacuum Cleaner
Prova sul campo...
Grazie dell'Attenzione
</END>
Q-Learning on Vacuum Cleaner