Original title:
Implementation of 1D mathematical model of vocal cavities into TTS synthesizer – preliminary study
Authors:
Radolf, Vojtěch ; Horák, Petr Document type: Papers Conference/Event: Interaction and Feedbacks 2012 /19./, Praha (CZ), 2012-11-27 / 2012-11-28
Year:
2012
Language:
eng Abstract:
Simplified 1D mathematical models of the human vocal tract were modified for using them in Text-To-Speech systems so that they help to simulate emotional speech. The geometry (area function) of the models for all Czech vowels was modified using the inverse task optimization procedure so that the computed formant frequencies match the measured formant frequencies of utterances of professional speaker. Output acoustic pressure signal generated from the models in wav format sounded satisfactorily for all the vowels and fundamental frequencies varied in an octave range from 77 Hz to 156 Hz. Neverthelles more testing procedures are needed to verify reliability and quickness of the model as well as intelligibility of generated utterances especially in formant TTS system and linear predictive TTS system.
Keywords:
biomechanics of voice; prosody modeling; synthetic speech Project no.: CEZ:AV0Z20760514 (CEP), CEZ:AV0Z20670512 (CEP), GPP101/12/P579 (CEP) Funding provider: GA ČR Host item entry: Interaction and Feedbacks 2012, ISBN 978-80-87012-43-7
Institution: Institute of Thermomechanics AS ČR
(web)
Document availability information: Fulltext is available at the institute of the Academy of Sciences. Original record: http://hdl.handle.net/11104/0213969