Thank you lissyx for the quick response.
I am working on a prototype which will need an ASR function for Mandarin.
I am trying to train a model based on the DeepSpeech and this data set
Given an awesome work has been done by yuwu that provides the train materials for the above data set needed by DeepSpeech
http://blog.yuwu.me/wp-content/uploads/2018/07/thchs30-csv.tar.gz
I am reusing these materials (alphabet.txt, vocabulary.txt, words.arpa, lm.binary and the trie) to train the model for a quick testing now.
I was able to train the model to reduce the loss to less than 50 by using the latest master branch of DeepSpeech. But when it is ready to exit the training and do test, it throws the following exception
Error: Can’t parse trie file, invalid header. Try updating your trie file.
I guess the trie from yuwu’s result may be out of date, so I build the generate_trie by following https://github.com/mozilla/DeepSpeech/blob/master/native_client/README.md
And then I use the generate_trie command to generate the new trie based on the above yuwu’s alphabet.txt and lm.binary, the new generated trie is only 9 bytes, I don’t know what is wrong. May be the lm.binary is also out of date, I may need regenerate lm.binary as well. But I have not give that a try.
I am wondering if you guys can give me some advice if that is on the correct direction before I try to regenerate the lm.binary.
Thanks