Hindi accent using deepspeech

bharath.vadithya · October 10, 2019, 2:01pm

How to train and evaluate on Hindi accent (speech to text).

There is an audio file in Hindi mixed with English (few words that are used common) now i need translate Hindi audio to English as text.
find the sentiment on the transcribed words

lissyx · October 10, 2019, 7:39am

@bharath.vadithya I’m not sure I understand your post here, are you asking for how to do the three items ? Are you exploring alternatives to support Hindi ? Please elaborate.

bharath.vadithya · October 10, 2019, 2:06pm

Hi @lissyx… I have edited the question… i need those two items to complete. yes i am exploring alternatives to support Hindi, so that i an train my model better to understand Hindi and can transcribe to English. Thanks

lissyx · October 10, 2019, 2:19pm

Do you want to analyze emotions ? This is out of the scope of what we do

I’m unsure here, you want to transcribe Hindi, with a few common words of English ?

bharath.vadithya · October 11, 2019, 3:37am

yeah i want to transcribe Hindi, with few common words of English.

bharath.vadithya · October 11, 2019, 3:42am

alright, I understand this !!

lissyx · October 11, 2019, 7:24am

Then you would need a training dataset in Hindi, I guess. And you could add the (expected?) english words in your language model?

bharath.vadithya · October 11, 2019, 9:25am

yeah now i got the training data… where i can add it … (I am new to this area)

lissyx · October 11, 2019, 12:29pm

Can you read the documentation ? This should be covered quite correctly.

bharath.vadithya · October 11, 2019, 12:30pm

where it is ?? the gihub one !!

lissyx · October 11, 2019, 12:45pm

github.com

mozilla/DeepSpeech/blob/master/TRAINING.rst

Training Your Own Model
=======================

Prerequisites for training a model
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^


* `Python 3.6 <https://www.python.org/>`_
* `Git Large File Storage <https://git-lfs.github.com/>`_
* Mac or Linux environment

Getting the training code
^^^^^^^^^^^^^^^^^^^^^^^^^

Install `Git Large File Storage <https://git-lfs.github.com/>`_ either manually or through a package-manager if available on your system. Then clone the DeepSpeech repository normally:

.. code-block:: bash

   git clone https://github.com/mozilla/DeepSpeech

This file has been truncated. show original

bharath.vadithya · October 11, 2019, 12:49pm

what about windows ??? actually i don’t have linux or mac

lissyx · October 11, 2019, 12:50pm

It may work but we don’t use that. It looks like TensorFlow GPU is supported there. That’s the best I can tell.

bharath.vadithya · October 11, 2019, 12:51pm

thanks @lissyx

bharath.vadithya · October 14, 2019, 5:35am

is there any ds_ctcdecoder for windows platform… the wheel file which is present in the github command line is linux supported.

lissyx · October 14, 2019, 7:18am

No, we don’t have that (yet?).

bharath.vadithya · October 14, 2019, 7:21am

github.com

mozilla/DeepSpeech/blob/master/TRAINING.rst#installing-python-dependencies

Training Your Own Model
=======================

Prerequisites for training a model
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^


* `Python 3.6 <https://www.python.org/>`_
* `Git Large File Storage <https://git-lfs.github.com/>`_
* Mac or Linux environment

Getting the training code
^^^^^^^^^^^^^^^^^^^^^^^^^

Install `Git Large File Storage <https://git-lfs.github.com/>`_ either manually or through a package-manager if available on your system. Then clone the DeepSpeech repository normally:

.. code-block:: bash

   git clone https://github.com/mozilla/DeepSpeech

This file has been truncated. show original

but the above link says that if i can prefer … i can build the wheel file from source… here is the guide link iam following below

github.com

mozilla/DeepSpeech/blob/master/native_client/README.rst


Building DeepSpeech Binaries
============================

If you'd like to build the DeepSpeech binaries yourself, you'll need the following pre-requisites downloaded and installed:


* `Mozilla's TensorFlow ``r1.14`` branch <https://github.com/mozilla/tensorflow/tree/r1.14>`_
* `General TensorFlow requirements <https://www.tensorflow.org/install/install_sources>`_
* `libsox <https://sourceforge.net/projects/sox/>`_

It is required to use our fork of TensorFlow since it includes fixes for common problems encountered when building the native client files.

If you'd like to build the language bindings or the decoder package, you'll also need:


* `SWIG >= 3.0.12 <http://www.swig.org/>`_
* `node-pre-gyp <https://github.com/mapbox/node-pre-gyp>`_ (for Node.JS bindings only)

Dependencies

This file has been truncated. show original

lissyx · October 14, 2019, 9:57am

I am unsure what you are referring to here. We don’t support building the decoder for Windows, period.

bharath.vadithya · November 14, 2019, 8:44am

hi @lissyx … i am trying to train my own model training. i got some problem and unable to figure out.

i guess some path error, but still i got this

‘[’ ‘!’ -f DeepSpeech.py ‘]’
python3 -u DeepSpeech.py --train_files minigir/miniger-train.csv --dev_files minigir/miniger-train.csv --test_files minigir/miniger-train.csv --train_batch_size 48 --dev_batch_size 40 --test_batch_size 40 --n_hidden 1024 --epochs 64 --early_stop True --es_steps 6 --es_mean_th 0.1 --es_std_th 0.1 --dropout_rate 0.30 s
Traceback (most recent call last):
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/indexes/base.py”, line 4736, in get_value
return libindex.get_value_box(s, key)
File “pandas/_libs/index.pyx”, line 51, in pandas._libs.index.get_value_box
File “pandas/_libs/index.pyx”, line 47, in pandas._libs.index.get_value_at
File “pandas/_libs/util.pxd”, line 98, in pandas._libs.util.get_value_at
File “pandas/_libs/util.pxd”, line 83, in pandas._libs.util.validate_indexer
TypeError: ‘str’ object cannot be interpreted as an integer

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File “/home/metlife-vad/DeepSpeech/util/text.py”, line 85, in text_to_char_array
transcript = np.asarray(alphabet.encode(series[‘transcript’]))
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/series.py”, line 1071, in getitem
result = self.index.get_value(self, key)
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/indexes/base.py”, line 4744, in get_value
raise e1
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/indexes/base.py”, line 4730, in get_value
return self._engine.get_value(s, k, tz=getattr(series.dtype, “tz”, None))
File “pandas/_libs/index.pyx”, line 80, in pandas._libs.index.IndexEngine.get_value
File “pandas/_libs/index.pyx”, line 88, in pandas._libs.index.IndexEngine.get_value
File “pandas/_libs/index.pyx”, line 131, in pandas._libs.index.IndexEngine.get_loc
File “pandas/_libs/hashtable_class_helper.pxi”, line 1607, in pandas._libs.hashtable.PyObjectHashTable.get_item
File “pandas/_libs/hashtable_class_helper.pxi”, line 1614, in pandas._libs.hashtable.PyObjectHashTable.get_item
KeyError: ‘transcript’

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File “DeepSpeech.py”, line 931, in
absl.app.run(main)
File “/home/metlife-vad/.local/lib/python3.7/site-packages/absl/app.py”, line 299, in run
_run_main(main, args)
File “/home/metlife-vad/.local/lib/python3.7/site-packages/absl/app.py”, line 250, in _run_main
sys.exit(main(argv))
File “DeepSpeech.py”, line 915, in main
train()
File “DeepSpeech.py”, line 435, in train
train_phase=True)
File “/home/metlife-vad/DeepSpeech/util/feeding.py”, line 101, in create_dataset
df[‘transcript’] = df.apply(text_to_char_array, alphabet=Config.alphabet, result_type=‘reduce’, axis=1)
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/frame.py”, line 6928, in apply
return op.get_result()
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/apply.py”, line 186, in get_result
return self.apply_standard()
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/apply.py”, line 292, in apply_standard
self.apply_series_generator()
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/apply.py”, line 321, in apply_series_generator
results[i] = self.f(v)
File “/home/metlife-vad/.local/lib/python3.7/site-packages/pandas/core/apply.py”, line 112, in f
return func(x, *args, **kwds)
File “/home/metlife-vad/DeepSpeech/util/text.py”, line 91, in text_to_char_array
raise ValueError(‘While processing: {}\n{}’.format(series[‘wav_filename’], e))
ValueError: (“While processing: /home/metlife-vad/DeepSpeech/minigir/wav/tmp.wav\n’transcript’”, ‘occurred at index 0’)

lissyx · November 14, 2019, 8:44am

@bharath.vadithya I’m sorry but please use code formatting, this is unreadable.