Intermediate layer(BiRNN) output for an audio file

kiran · March 26, 2019, 6:42am

How to take the intermediate layer embedding of an audio file from a trained deepspeech model.

dwn · March 26, 2019, 6:49am

The easiest way to get started is by just adding the layer of interest e.g. layers['layer_5'] to the first argument of session.run, and add a corresponding variable to the left of the = before session.run

kiran · March 26, 2019, 6:57am

Is there a code to take inference of an audio file from deepspeech model without using deepspeech binary

dwn · March 26, 2019, 7:24am

Sure, have a look at FLAGS.one_shot_infer in the DeepSpeech.py file.