I am using Deepspeech 0.5.1. My use case is to recognise few voice commands (mainly digits and few other words). I created custom LM for my commands. Here is what I did for it:
Vocabulary.txt (containing the command to be recognised):
one
two
three
four
five
six
seven
eight
nine
yes
no
tell me options
need help
Generate LM
~/terminal/kenlm/build/bin/lmplz --text vocabulary.txt --arpa words.arpa --order 5 --discount_fallback --temp_prefix /tmp/
# Generate lm
~/terminal/kenlm/build/bin/build_binary -T -s words.arpa lm.binary
# Generate trie
~/terminal/repository/DeepSpeech/generate_trie alphabet.txt lm.binary trie
Problem :
it’s working reasonably well, it does recognise the single digits and other sentences in vocabulary.txt. However it does not work well when I speak multiple digits together : e.g. “four nine seven” … It misses one or more words and mostly gives only single word output (though sometime multiple words do come).
Am I doing something wrong? How to improve results.