999C1a:GA102/05/0278 - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: 999C1a:GA102/05/0278

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Lze použít automatické rozpoznávání řeči k hodnocení kvality řeči? Nouza, Jan ; Vích, Robert ; Vondra, Martin In the contribution several case studies are presented in which automatic speech recognition was tested as a means for evaluating of speech quality, either human or synthetic. Usually, speech quality is measured by subjective listening tests. Our aim is to investigate, whether these tests, which request considerable amount of human time and experience, could be replaced or supplemented by techniques based on ASR. Detailed record
	Konverze pohádkových hlasů pro TTS systém s kepstrálním popisem Přibil, Jiří ; Přibilová, Anna Our recent research in improvement of text-to-speech (TTS) synthesis was aimed at storytelling speaking style in addition to its multi-voice realization and expression of emotional states. Storytelling speaking style is suitable for applications aimed at children as well as applications aimed at blind people. In this contribution the experiments with the storytelling voice conversion performed on the short sentences of stories in Slovak and Czech are described. Detailed record
	Návrh vhodných prozodických modelů pro dialogové systémy Horák, Petr This paper deals with the improving of the synthetic prosody modeling especially with the improving of the intonation modeling. A mathematical model of the pitch contour modeling can significantly limit the complexity of intonation rules creation and increase the naturalness of resulting synthetic speech. The linear prediction intonation model implemented in TTS system Epos uses excitation by rules and provides in conjunction with a triphone time domain inventories more naturalness synthetic speech. Detailed record
	Současný stav vývoje českého TTS systému EPOS Chaloupka, Zdeněk ; Horák, Petr This contribution is focused on the current state of the Epos Text-To-Speech (TTS) system. Recently, a MBROLA-like synthesis interface has been developed. This interface synthesizes speech phone by phone, so the length and prosody points of each phone are specified. Several problems were encountered while performing MBROLA-like synthesis. These problems are associated with the speech inventory, which is primarily designed for the time domain Pitch-Synchronous OverLap-and-Add (PSOLA) synthesis. Detailed record
	Úvodní kurs počítačového zpracování řeči pro studenty bakalářského studia Nouza, Jan The article presents concept of a one-semester course, prepared and taught by the author during his stay at ETH in Zurich in 2006. Detailed record
	Experimenty se vzájemnou záměnou řečových stylů s použitím lineárního mapování průběhu FO v časové oblasti Přibil, Jiří ; Přibilová, Anna In this paper the experiments with the speaking styles transposition performed on the speech utterances produced by the TTS system with basic prosody are desribed. Speaking styles prototypes derived from five emotional states were obtained on the sentences with the same information content. The problem with different frame lenght between the prototype and the target utterance was solved by linear time scale mapping. The results were evluated by the listening tests of the resynthetised utterances. Detailed record
	Možnosti modelování prozodie TTS systému Epos s použitím MBROLA rozhraní Horák, Petr ; Chaloupka, Zdeněk This paper deals with prosody modelling possibilities of the Epos Text-To Speech (TTS) sytem. The aim is to build the TTS system with an absolute duration modelling of the separate sounds and the possibility of using MBROLA compatible Czech voices. The Epos is a very flexible language independent TTS system. It can be widely configured without the need of recompilation. The use of editable rules for every step during the synthesis makes it easy to monitor the progress of the synthesis and to apply changes. Detailed record
	Použití RLPC inventářů systému Festival v Eposu Chaloupka, Zdeněk ; Horák, Petr The aim of this paper is to describe a possibility of the new voices implementation into the Epos text-to-speech (TTS) system. We implemented voices from the Festival TTS system. This system synthesizes text from speech units, which are stored in an inventory file as Residual Linear Prediction Coding (RLPC) coefficients. The inventory file provides every information needed for the text synthesis. The text is synthesized in the MROLA format, thus a phoneme length (and a prosody) can be determined directly. Detailed record
	Hodnocení změny hlasu v TTS systému s kepstrálním popisem Přibil, Jiří ; Přibilová, Anna Voice conversion, i.e. modification of a speech signal to sound as if spoken by a different speaker, finds its use in speech synthesis with a new voice without necessity of a new database. It is very useful in special aids for blind and partially sighted people, e.g. Braille notetaker based on the Pocket PC. This paper evaluates voice conversion implemented in the Czech and Slovak text-to-speech (TTS) system based on cepstral description of speech database using the source-filter speech model. Detailed record
	Frequency scale mapping methods for voice conversion Přibilová, Anna The research report compares several approcaches to frequency scale mapping for voice conversion. All the presented methods may be used for transformation of voice characteristics between male and female or childisch. The advantage of the proposed non-linear methods is possibility of their implementation in the text-to-speech systems with cepstral description because of the unchanged number of points of the modified speech spectrum envelope in FFT-based methods. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English