Chia-Sheng Chen
ASR
Text
Magic
Response
TTS
This Topic
"I don't feel good"
"Why do you say you do not feel good?" *
* generated by AI created in 1960s
Attention Is All You Need NIPS '17, A. Vaswan et al.
Audio
Transformer
MFCC
dim = 39
Multi-Head attention
d_model = 39
h = 3
d_k = d_v = 13
Hidden Layer
d_hidden = 64
By qitar888