Antonin RAFFIN - Imad EL HANAFI
Understanding code
SublimeText
Git
Libraries :
MatplotLib
TKinder
http://Gitlab.ensta.fr
Step 1 :
Stupid agents : To understand the code
V value agent : Incremental and Batch
Q value agent : Incremental and Batch
Step 2 :
2D environment
Step 3 :
Temporal Differencing agent
Step 4 :
GUI environments
Tetris simple
2D with walls :
Tetris
Learning curves :
Qvalue : 2D - 20 cells
TD : 2D - 20 cells
Learning curves :
TD : 2D - static walls
TD : 2D -Moving walls
Learning curves on TETRIS simple :
TD : Tetris - 100actions - 3rows
TD : Tetris -100actions - 5rows
Learning curves on TETRIS simple :
TD : Tetris - 100actions - 3rows
TD : Tetris -100actions - 5rows
Learning curves on TETRIS simple :
TD : Tetris - 100actions - Bad Rewards
TD : Tetris -1000actions - 5rows
4 weeks project
Discovering