-
OpenEdu MB-RL: MCTS, AlphaZero, MuZero
Apr 26, 2022
392
0
-
Copy of Title DRLRG
Apr 07, 2022
353
0
-
Title DRLRG
Feb 24, 2022
411
1
-
Copy of К анонсу Practical-RL ШАД
Jan 26, 2022
414
0
-
К анонсу Practical-RL ШАД
Jan 25, 2022
410
0
-
CEM
Jan 20, 2022
368
0
-
Copy of FTIAD RL #5 Policy Gradient
-
RL #2 Dynamic Programming
Nov 16, 2021
457
0
-
FTIAD RL #1 Intro, BC, CEM
Nov 09, 2021
564
0
-
AIRI report Interview
Oct 21, 2021
408
0
-
Exploration vs. Exploitation
Oct 19, 2021
369
0
-
deck
Sep 24, 2021
326
0
-
Intro in RL for Sber (Sk)
Aug 24, 2021
383
0
-
Bespoke shoe making (Russian)
Jun 30, 2021
437
0
-
Bespoke shoe making
Jun 21, 2021
467
0
-
MB-RL: LQR, iLQR, DDP
Apr 21, 2021
493
0
-
PI and VI (OZON)
Feb 17, 2021
531
0
-
Bayesian Exploration for Petroleum Industry
Feb 15, 2021
491
0
-
deck
Feb 03, 2021
415
0
-
Cross Entropy Method (OZON)
Jan 31, 2021
513
0
-
Bandit Algorithms
Jan 11, 2021
358
0
-
exam_presentation
Dec 08, 2020
634
0
-
Serbia_meeting_8_12_20
Dec 08, 2020
485
0
-
MB-RL: MCTS, AlphaZero, MuZero
Dec 04, 2020
522
0
-
Imitation and Inverse RL
Dec 01, 2020
337
0
-
Annual Review
Oct 22, 2020
495
0
-
Distributional RL
Oct 12, 2020
515
1
-
Micro-Learning Temirchev Pavel
Jun 09, 2020
581
0
-
Metamodelling colossal plans
Apr 13, 2020
480
0
-
Adv RL: RL as probabilistic inference
Mar 24, 2020
2,316
1
-
Intro to RL, lecture 2: Q-learning (ISP)
Jan 27, 2020
524
0
-
Intro to RL, lecture 1: Tabular RL (ISP)
Jan 27, 2020
533
0
-
Decomposition of Uncertainty in Bayesian Deep Learning
Oct 30, 2019
520
0
-
Dynamic Mode Decomposition
Oct 16, 2019
561
0
-
Intro to RL, part 1 (pizza-seminar)
Jul 10, 2019
565
0
-
(bayessem) Reinforcement Learning as Probabilistic Inference
May 17, 2019
594
0
-
(datafest) Reinforcement Learning as Probabilistic Inference
Mar 31, 2019
536
0
-
Model-Ensemble Trust Region Policy Optimization
Mar 29, 2018
542
0
-
Approximate Linear Programming for Markov Decision Processes
Oct 03, 2017
531
0
-
Curiosity-driven Exploration by Self-Supervised Prediction
Jun 06, 2017
691
0