Hi,
I try to train the french common voice data set but the result is very bad.
My command
./DeepSpeech.py --dev_files /data/clips/dev.csv --test_files /data/clips/test.csv --train_files /data/clips/train.csv --train_batch_size 80 --dev_batch_size 80 --test_batch_size 40 --n_hidden 375 --epoch 100 --dropout_rate 0.22 --learning_rate 0.00095 --report_count 100 --use_seq_length False --checkpoint_dir /data/checkpoints --export_dir /data/models --alphabet_config_path /data/alphabet.txt 2>&1 | tee output.log
Result
WER: 1.000000, CER: 23.000000, loss: 2.552169
- src: “je le retire monsieur le président”
- res: “jelâreziâierezi”
WER: 1.000000, CER: 7.000000, loss: 2.598004
- src: “quel aveu”
- res: “lee”
WER: 1.000000, CER: 70.000000, loss: 2.705939
- src: “je suis saisi de deux amendements identiques numéros deux cent quarantecinq et trois cent soixantecinq”
- res: “jeiâieeâeneneâiqerâenqârâneineinziâneinq”
WER: 1.000000, CER: 8.000000, loss: 2.796613
- src: “rouge vif”
- res: “i”
WER: 1.000000, CER: 11.000000, loss: 2.822357
- src: “quatre grands”
- res: “ree”
WER: 1.000000, CER: 11.000000, loss: 2.837523
- src: “ça vous plait”
- res: “all”
WER: 1.000000, CER: 14.000000, loss: 2.878527
- src: “vous l’avez remarqué”
- res: “nlâzezeârq”
One more thing, how to create a vocabulary.txt with this kind of dataset ?
thanks