Batching during inference

david0861 · September 24, 2018, 6:55am

Hello,
Does native_client.py support inferencing of > 1 audio file at the same time? I am looking to use my GPUs for inferencing and optimize the utilization by batching the requests from multiple audio files.

kdavis · September 24, 2018, 9:00am

No it does not.

However, having such a batching is a good idea and wold also be useful for server based STT systems to batch all requests within some time window on to the GPU at the same time.