Volker Strom


2010

pdf bib
Resources for Speech Synthesis of Viennese Varieties
Michael Pucher | Friedrich Neubarth | Volker Strom | Sylvia Moosmüller | Gregor Hofer | Christian Kranzler | Gudrun Schuchmann | Dietmar Schabus
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

This paper describes our work on developing corpora of three varieties of Viennese for unit selection speech synthesis. The synthetic voices for Viennese varieties, implemented with the open domain unit selection speech synthesis engine Multisyn of Festival will also be released within Festival. The paper especially focuses on two questions: how we selected the appropriate speakers and how we obtained the text sources needed for the recording of these non-standard varieties. Regarding the first one, it turned out that working with a ‘prototypical’ professional speaker was much more preferable than striving for authenticity. In addition, we give a brief outline about the differences between the Austrian standard and its dialectal varieties and how we solved certain technical problems that are related to these differences. In particular, the specific set of phones applicable to each variety had to be determined by applying various constraints. Since such a set does not serve any descriptive purposes but rather is influencing the quality of speech synthesis, a careful design of such a set was an important task.