|
Zpracování řeči
Proceedings of the Workshop on Speech Processing is a periodic publication collecting the contributions presented at the Czech-German Workshop organized every year in September in Prague This proceedings volume includes30 papers by 55 authors. Papers are devoted to phonetics and prosody, construction of dialogs, speech analysis, synthesis and enhancement, speaker and speech recognition and voice conversion.
|
| |
| |
|
Algoritmy potlačení šumu v řeči zkreslené telekomunikační sítí
Koula, Ivan ; Esposito, A.
This paper aims to provide an evaluation of the effectiveness of three different speech enhancement algorithms. The evaluation of their efficiency was based on the hit rate recognition by computer speech recognizer. These speech enhancement algorithms are based on the method of spectral subtraction and differ by noise spectrum estimation system. The first algorithm estimates noise spectrum on the basis of its statistical characters. Next two algorithms estimate noise spectrum by nonlinear adaptive models.
|
| |
|
Možnosti modelování prozodie TTS systému Epos s použitím MBROLA rozhraní
Horák, Petr ; Chaloupka, Zdeněk
This paper deals with prosody modelling possibilities of the Epos Text-To Speech (TTS) sytem. The aim is to build the TTS system with an absolute duration modelling of the separate sounds and the possibility of using MBROLA compatible Czech voices. The Epos is a very flexible language independent TTS system. It can be widely configured without the need of recompilation. The use of editable rules for every step during the synthesis makes it easy to monitor the progress of the synthesis and to apply changes.
|
| |
|
Použití RLPC inventářů systému Festival v Eposu
Chaloupka, Zdeněk ; Horák, Petr
The aim of this paper is to describe a possibility of the new voices implementation into the Epos text-to-speech (TTS) system. We implemented voices from the Festival TTS system. This system synthesizes text from speech units, which are stored in an inventory file as Residual Linear Prediction Coding (RLPC) coefficients. The inventory file provides every information needed for the text synthesis. The text is synthesized in the MROLA format, thus a phoneme length (and a prosody) can be determined directly.
|