cirquit
PhD student with a focus on machine learning, distributed systems and functional programming.
Introduction
1
Introduction
2
Fundamentals
3
Supervised Learning
4
Mammal
Not a mammal
Mammal?
Supervised Learning
5
x1
x2
0
1
0
1
x1 | x2 | ∧ |
---|---|---|
0 | 0 | 0 |
0 | 1 | 0 |
1 | 0 | 0 |
1 | 1 | 1 |
∧ = 0
∧ = 1
0
0
0
1
Supervised Learning
6
x1
x2
0
1
0
1
h < 0
h >= 0
How to learn such a function?
Supervised Learning
7
Supervised Learning
8
< 0: 0 (false)
>= 0: 1 (true)
Supervised Learning
9
0 = false
1 = true
Input:
Randomly choosen weights:
x1
x2
0
1
0
1
Supervised Learning
10
x_1 | x_2 | y | ^y | C |
---|---|---|---|---|
0 | 0 | 0 | 0 | 0 |
0 | 1 | 0 | 1 | -1 |
1 | 0 | 0 | 1 | -1 |
1 | 1 | 1 | 1 | 0 |
x1
x2
0
1
0
1
-1
-1
Supervised Learning
11
-1
0
0
1
Update rule:
Supervised Learning
12
Misclassified:
Update rule:
Learning rate:
Weights:
Supervised Learning
13
x1
x2
0
1
0
1
Supervised Learning
14
x1
x2
0
1
0
XOR
1
Supervised Learning
15
13% #0
0% #1
5% #2
1% #3
67% #4
2% #5
2% #6
3% #7
3% #8
4% #9
Supervised Learning
16
13% #0
0% #1
5% #2
1% #3
67% #4
2% #5
2% #6
3% #7
3% #8
4% #9
Supervised Learning
17
b/w pixel data
13% #0
0% #1
5% #2
1% #3
67% #4
2% #5
2% #6
3% #7
3% #8
4% #9
Supervised Learning
18
Index
b/w
Supervised Learning
19
b/w pixel data
13% #0
0% #1
5% #2
1% #3
67% #4
2% #5
2% #6
3% #7
3% #8
4% #9
Supervised Learning
20
Supervised Learning
21
https://handong1587.github.io/deep_learning/2015/10/09/fun-with-deep-learning.html
Supervised Learning
22
https://github.com/luanfujun/deep-photo-styletransfer
Supervised Learning
23
Normal text
Randomly generated text
Music
https://deepmind.com/blog/wavenet-generative-model-raw-audio/
Unsupervised Learning
24
Feature 1
Feature 2
Feature 1
Feature 2
Unknown structure
Known structure
Unsupervised Learning
25
Feature 1
Feature 2
Unsupervised Learning
26
Feature 1
Feature 2
Unsupervised Learning
27
Feature 1
Feature 2
Unsupervised Learning
28
Feature 1
Feature 2
Unsupervised Learning
29
Feature 1
Feature 2
Unsupervised Learning
30
Feature 1
Feature 2
Unsupervised Learning
31
Feature 1
Feature 2
Unsupervised Learning
32
Unsupervised Learning
33
http://practicalquant.blogspot.de/2013/10/semi-automatic-method-for-grading-a-million-homework-assignments.html
Reinforcement Learning
34
Environment
Agent
Action
State
Reward
Reinforcement Learning
35
Environment
Agent
Action
State
Reward
Reinforcement Learning
36
Environment
Agent
Action
State
Reward
Reinforcement Learning
37
Environment
Agent
Action
State
Reward
Reinforcement Learning
38
Environment
Agent
Action
State
Reward
Reinforcement Learning
39
Environment
Agent
Action
State
Reward
Reinforcement Learning
40
Environment
Agent
Action
State
Reward
Reinforcement Learning
41
Environment
Agent
Action
State
Reward
Reinforcement Learning
42
Environment
Agent
Action
State
Reward
Reinforcement Learning
43
Environment
Agent
Action
State
Reward
Represents the quality of an action in the current state, while continuing to play optimally from that point on
Reinforcement Learning
44
Problem: How to construct such a Q function?
Reinforcement Learning
45
Maximal reward is defined as immediate reward + maximum future reward for next state
Reinforcement Learning
46
Reinforcement Learning
47
Reinforcement Learning
48
NN
Reinforcement Learning
49
Experience Replay
Exploration - Exploitation
Slides adapted from excellent tutorial
https://www.nervanasys.com/demystifying-deep-reinforcement-learning/
Reinforcement Learning
50
https://yanpanlau.github.io/2016/10/11/Torcs-Keras.html
Reinforcement Learning
51
Perceptron
Backpropagation
Neural Nets
Supervised Learning
MNIST Dataset
Linear Separable
Unsupervised Learning
Clustering
K-Means
Distance Measures
Convergence
RL Learning
Policy
Q-Learning
Discount Rate
TORCS
Reinforcement Learning
52
By cirquit