Original title: Implementation of 1D mathematical model of vocal cavities into TTS synthesizer – preliminary study
Authors: Radolf, Vojtěch ; Horák, Petr
Document type: Papers
Conference/Event: Interaction and Feedbacks 2012 /19./, Praha (CZ), 2012-11-27 / 2012-11-28
Year: 2012
Language: eng
Abstract: Simplified 1D mathematical models of the human vocal tract were modified for using them in Text-To-Speech systems so that they help to simulate emotional speech. The geometry (area function) of the models for all Czech vowels was modified using the inverse task optimization procedure so that the computed formant frequencies match the measured formant frequencies of utterances of professional speaker. Output acoustic pressure signal generated from the models in wav format sounded satisfactorily for all the vowels and fundamental frequencies varied in an octave range from 77 Hz to 156 Hz. Neverthelles more testing procedures are needed to verify reliability and quickness of the model as well as intelligibility of generated utterances especially in formant TTS system and linear predictive TTS system.
Keywords: biomechanics of voice; prosody modeling; synthetic speech
Project no.: CEZ:AV0Z20760514 (CEP), CEZ:AV0Z20670512 (CEP), GPP101/12/P579 (CEP)
Funding provider: GA ČR
Host item entry: Interaction and Feedbacks 2012, ISBN 978-80-87012-43-7

Institution: Institute of Thermomechanics AS ČR (web)
Document availability information: Fulltext is available at the institute of the Academy of Sciences.
Original record: http://hdl.handle.net/11104/0213969

Permalink: http://www.nusl.cz/ntk/nusl-135432


The record appears in these collections:
Research > Institutes ASCR > Institute of Thermomechanics
Conference materials > Papers
 Record created 2013-01-04, last modified 2021-11-24


No fulltext
  • Export as DC, NUŠL, RIS
  • Share