I’m trying to get timing information on the transcribed speech; IE when words were spoken. I was looking over the repo in github and I saw this:
This commit and the discussion around it seem to indicate that this feature has been implemented. It’s my first time looking at deepspeech though, and I’m not sure how to invoke this feature if it actually exists.
Any help would be much appreciated.