The first 20 lines from the supplied vocab.txt
file from DeepSpeech/data/lm
folder are:
a
a''s
a''t
a'a
a'ad
a'ade
a'ain't
a'al
a'am
a'an
a'ana
a'andy
a'ane
a'ant
a'arf
a'b'c'd
a'b'cd
a'b'd'd'c'a
a'b'ilin
What is the usage of apostrophe here. More precisely what should each line of vocabulary file contain? Should each line simpy contain each word that appears in the language corpus? Or is there any special formatting required?