How many 'r' letters are in the word 'strawberry'?
Uhh, like 2?
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Multi-Head Attention
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
0.51 | 0.12 | 1 | 0 | 0 | 0.45 | 0.50 | 0.29 | ... | 0.77 |
---|
0.98 | 0.32 | 0.63 | 0.92 | 0.17 | 0 | 0.07 | 0.83 | ... | 1 |
---|
0.43 | 1 | 0.95 | 1 | 0.54 | 0.31 | 0.19 | 0 | ... | 0 |
---|
0.53 | 0.52 | 0.51 | 0.92 | 0.78 | 0.71 | 0.99 | 0.84 | ... | 0.91 |
---|
0.82 | 0.91 | 0.56 | 0.59 | 0.99 | 0.42 | 1 | 1 | ... | 0.72 |
---|
0.60 | 0.15 | 0.75 | 0.59 | 0.01 | 0.07 | 0 | 0.27 | ... | 0.33 |
---|
0.75 | 0.61 | 0.98 | 0.02 | 0 | 0.12 | 0.17 | 0.29 | ... | 0.57 |
---|
0.74 | 0.59 | 0.98 | 0.02 | 0.02 | 0.11 | 0.17 | 0.30 | ... | 0.56 |
---|
0.59 | 0.95 | 0 | 0.85 | 1 | 0.67 | 0.15 | 0.72 | ... | 0.25 |
---|
0.60 | 0.95 | 0 | 0.85 | 0.97 | 0.68 | 0.17 | 0.72 | ... | 0.23 |
---|
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
0.75 | 0.61 | 0.98 | 0.02 | 0 | 0.12 | 0.17 | 0.29 | ... | 0.57 |
---|
0.23 | 0.15 | 0.79 | 0.09 | 0 | 0.18 | 0.82 | 0.58 | ... | 0.73 |
---|
0.35 | 0.01 | 0.09 | 0.38 | 0.22 | 0 | 0 | 0.99 | ... | 1 |
---|
sin(0) 0 |
cos(0) 1 |
sin(0) 0 |
cos(0) 1 |
... | ... | ... | ... | ... | ... |
---|
sin(1/1) 0 |
cos(1/1) 0.54 |
sin(1/10) 0.1 |
cos(1/10) 1 |
... | ... | ... | ... | ... | ... |
---|
sin(2/1) 0.91 |
cos(2/1) -0.41 |
sin(2/10) 0.2 |
cos(2/10) 0.98 |
... | ... | ... | ... | ... | ... |
---|
0.74 | 0.59 | 0.98 | 0.02 | 0.02 | 0.11 | 0.17 | 0.30 | ... | 0.56 |
---|
0.23 | 0.15 | 0.79 | 0.09 | 0 | 0.18 | 0.82 | 0.58 | ... | 0.73 |
---|
...
...
...
a0 | b0 | c0 | d0 | e0 | f0 | g0 | h0 | ... | z0 |
---|
x
x
x
x
x
x
x
x
x
a1 | b1 | c1 | d1 | e1 | f1 | g1 | h1 | ... | z1 |
---|
a0a1 | b0b1 | c0c1 | d0d1 | e0e1 | f0f1 | g0g1 | h0h1 | ... | z0z1 |
---|
=
+
+
+
+
+
+
+
+
+
=
final similarity result
21 | -3.2 | 14.2 | 15.7 | -7.1 | 0.1 | 3.5 | -2.9 | ... | 0.2 |
---|
0.994 | 0 | 0.001 | 0.005 | 0 | 0 | 0 | 0 | ... | 0 |
---|
S0 |
---|
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
Linear
Softmax
Output Probabilities
Output Embedding
+
+
Positional Encoding
Input Embedding
Positional Encoding
Add & Norm
Multi-Head Attention
Feed Forward
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Outputs (shifted right)
Add & Norm
Inputs
0.74 | 0.59 | 0.98 | 0.02 | 0.02 | 0.11 | 0.17 | 0.30 | ... | 0.56 |
---|
1x256
-0.012 | 0.001 | 0.02 | 0 | -0.1 | -0.002 | 0.15 | -0.05 | ... | 0.042 |
---|
1x50000
0% | 0% | 0% | 0% | 0% | 0% | 35% | 0% | ... | 1% |
---|
Linear
Softmax
1x50000
0% | 0% | 0% | 0% | 0% | 0% | 35% | 0% | ... | 1% |
---|
6
5
4
3
2
1
0
7
49999