Speech Project
Week-4 Progress Report
B02901054 方為
B02901085 徐瑞陽
Training Flow
Feature Extraction
Monophone
Equal align
即日起 簡章 備索 in lexicon.txt
Gmm align
即,起 的 '一' 拉很長 , '日' 是短音
tree
produced by gmm-init-mono?
Triphone
Convert-ali
decision tree <-> model
output is tree and model
same as gmm-init-mono
gmm-mixup
VS
gmm-est mixup
Meaning?
WFST
Viterbi
Expriment Report
# of Iters | # of gaussians | accuracy (%) |
---|---|---|
40 | 500 | 54.96 |
40 | 1000 | 54.64 |
40 | 1500 | 55.79 |
40 | 2000 | 55.1 |
40 | 2500 | 53.98 |
40 | 3000 | 51.22 |
40 | 4000 | 55.9 |
Viterbi Triphone
change # of gaussians
# of Iters | # of gaussians | accuracy (%) |
---|---|---|
40 | 1000 | 54.64 |
70 | 1000 | 55.90 |
100 | 1000 | 55.36 |
Viterbi Triphone
change # of iters
Viterbi Monophone
# of iters | # of gauss | accuracy(%) |
---|---|---|
40 | 1000 | 47.52 |
70 | 1000 | 45.59 |
FST Monophone
# of iters | # of gauss | accuracy(%) |
---|---|---|
40 | 1000 | 34.32 |
FST Triphone
# of iters | # of gauss | accuracy(%) |
---|---|---|
40 | 1000 | 31.86 |
8 hours ...= =" ,
monophone is 2 hours,
but still lower than monophone!?
Problems
we encountered
Gaussian has
too little data...OAO
The only difference
scp file for reading in feature extraction
Is it ok?
Sth weird happened
in new workstation...
SpeechProject-week4
By sunprinces
SpeechProject-week4
- 401