I’m testing out DeepSpeech with 8khz audio and seeing very poor accuracy.
Anyone have general pointers on how to improve this?
Should I train my own model (if so, what’s the recommended dataset size?)
Are there other approaches to working with DeepSpeech and 8khz audio?