Hi,
My ds-ctcdecoder
version is 0.6.0a15
I am running Debian GNU/Linux 9.11 (stretch) x86_64
I am using DeepSpeech version 0.6.0-alpha.15
I’ve been trying to train DeepSpeech on my own dataset.
Most of my work is available here.
It has the train/test csv splits, alphabet, vocab, trie, etc.
I generated the alphabet and vocab from data/tarteel/quran.json
using bin/generate_(alphabet|vocabulary).py
.
The binary and arpa files were generated using:
lmplz --order 5 --text vocabulary.txt --arpa words.arpa
build_binary -q 8 trie words.arpa lm.binary
../native_client/generate_trie alphabet.txt lm.binary quran.trie
native_client
was compiled in a different directory than the one I was working in, but with the same DeepSpeech version.
When running the following command
python3 -u "$PATH_TO_DEEPSPEECHPY" \
--log_dir "$LOG_DIR" \
--summary_dir "$SUMMARY_DIR" \
--alphabet_config_path "$ALPHABET_PATH" \
--checkpoint_dir "$CHECKPOINT_DIR" \
--train_files "$TRAIN_CSV_FILE" \
--dev_files "$DEV_CSV_FILE" \
--test_files "$TEST_CSV_FILE" \
--export_dir "$EXPORT_DIR" \
--lm_binary_path "$LM_BINARY_PATH" \
--lm_trie_path "$LM_TRIE_PATH" \
--lm_alpha 1.5 \
--dropout_rate 0.30 \
--train_batch_size 1 \
--dev_batch_size 1 \
--test_batch_size 1 \
--n_hidden 2048 \
--epochs 35 \
--early_stop true \
--es_steps 6 \
--es_mean_th 0.1 \
--es_std_th 0.1 \
--learning_rate 0.00095 \
"$@"
I get the following stack trace:
I STARTING Optimization
Epoch 0 | Training | Elapsed Time: 0:00:00 | Steps: 0 | Loss: 0.000000 Fatal Python error: Segmentation fault
Thread 0x00007f84dda86700 (most recent call first):
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/pandas/core/dtypes/common.py", line 1789 in is_extension_array_dtype
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/pandas/core/internals/blocks.py", line 3255 in get_block_type
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/pandas/core/internals/blocks.py", line 3284 in make_block
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/pandas/core/internals/managers.py", line 1518 in __init__
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/pandas/core/series.py", line 321 in __init__
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/pandas/core/frame.py", line 909 in iterrows
File "/home/anas/tarteel_ws/ml-infra/models/deepspeech/util/feeding.py", line 104 in generate_values
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/tensorflow/python/data/ops/dataset_ops.py", line 547 in generator_py_func
File "/home/anas/.virtualenvs/ds-env/lib/python3.5/site-packages/tensorflow/python/ops/script_ops.py", line 217 in __call__
Thread 0xFatal Python error: Bus error
./train_deepspeech.sh: line 97: 16523 Bus error
Any suggestions on why this is happening?