I am interested in creating a node.js application that takes user input and transcribes it. The end goal is to create an application that is completely voice controlled. I was researching utilities for this and I support and appreciate deepspeech by mozilla.
A Few Questions
- I installed deepspeech using npm install deepspeech, is this all I need?
- What is the optimal format and way to receive user audio from the client side- js?
3.On the Server Side whats the best way to recieve the data how would I transcribe the audio and post it back to the user?
This is pretty much the basic functionality for this project that I would like to test out, A homepage that you speak into and get back the transcribed words using deepspeech, any references to helpful tutorials and documentation would be greatly appreciated.