Hi,
I have a question around dataset to train a model from scratch.
Does dataset of acoustic model have to be subset of dataset of language model?
I think there are a few scenarios.
- Dataset of acoustic model is different from dataset of language model.
- Dataset of acoustic model is subset of dataset of language model.
- Dataset of acoustic model is same as dataset of language model.
I mean acoustic model can have wav audio and sentence like “I love dogs.”
Language model can have “I love dogs. I love cats too”.
Which is recommended?
Thanks in advance.