The command I am running:
./DeepSpeech.py --train_files …
–dev_files …/dev.csv --test_files …/test.csv
–checkpoint_dir … --export_dir … --epochs 20 --train_batch_size 64 --dev_batch_size 64 --test_batch_size 64 --early_stop False --summary_dir … --es_steps 50 --learning_rate 0.000075 --dropout_rate 0.20
The model is overfitting, validation error falls max to 70 and the training error continues to decrease. I am thinking of changing the hidden layers. from 2048 to 1800. I have played with the learning rate/dropout and it does not make a difference.
I am training on a p2.xlarge on the Common Voice Dataset.