Hi all,
Has the idea of adding a domain-specific feature to the sentences on the Common Voice project structure already been discussed here?
In certain areas a technical language is used (e.g. computer science, medicine, finance, law, etc.). Interesting new applications could be developed on the basis of an open source data pool also for these areas.
The sentences collected so far, however, are predominantly simple conversational language with geographical indications.
If, for example, one (or more) "context tags" could be added to a sentence via the Sentence Collector, there would be many new possibilities for later developing domain-specific applications with the data obtained.
Of course, it would also be useful or necessary to have this "context tag" for the speakers available for selection or filtering in the Common Voice Project. On the one hand, technical terms can often only be spoken correctly by speakers with a corresponding professional background. On the other hand, these speakers may not want to torture themselves with countless sentences in which the capital of a country has to be named ;-), but rather come up with the sentences that are important to them in order to support the project.
What do you think of the idea?