Národní úložiště šedé literatury Nalezeno 44 záznamů.  1 - 10dalšíkonec  přejít na záznam: Hledání trvalo 0.00 vteřin. 
Speech Processing
Vích, Robert
Speech Processing is a periodic publication collecting the contributions presented at the Czech-German Workshops on Speech Processing organized every year in September in Prague. The papers are devoted to speech analysis, synthesis, recognition, enhancement and phonetics.
Composite cepstral models for speech synthesis
Vondra, Martin ; Smékal, Z.
The composite cepstral model realizes the approximate inverse cepstral transformation. Its transfer function is realized as an IIR digital filter, in which the delay blocks are substituted by FIR filters, whose transfer functions are given by the Z-transform of the real cepstrum. In the contribution different implementations of the composite cepstral model using 1.sup.st.sup., 2.sup.nd.sup. and cascade canonic forms are examined and evaluated according to computational and storage requirements.
Microprosody analysis
Přibil, Jiří ; Vích, Robert
In the contribution the statistical and spectral analyses of the microintonation component for several speakers are performed and used for the synthesis of a FIR digital filter for suppresion of the microintonation signal prior to the decomposition of the virtual melody contour into the sentence and word melody.
Czech triphone synthesis of female voice
Horák, Petr ; Hesounová, Alžběta
The new triphone inventory of a female voice for TTS system has been finished. The motivation for its creation was the fact that there was no female voice synthesis for Czech that would be at our disposal, although it is needed in various applications. The corpus of the texts used for labelling of the new inventory consisted of 550 sentences. The texts were read by a professional female speaker who was instructed to pronounce the sentences with ideally a monotonous prosody, at a constant speech rate.
Differences in speech processing of male, female and child voices
Přibil, Jiří
This paper is focused on the description of differences in speech signal processing of male, female and child voices. Some problems with spectra smoothing of high female and child voices are discussed. Attention is also paid to right setting of input parameters in speech spectrogram calculation.
NNLab - platform base for text-to-speech synthesis
Santarius, J. ; Tučková, Jana
Prosody modelling in synthetic speech is one of the possible applications of artificial neural nets (ANN). The training and testing files for prosody modelling by ANN must be large; therefore it is useful to automatize the process. The modular program system NNLab is a powerful tool for an easy-to-operate ANN system for prosody modelling of synthetic speech. In the center of NNL environment is a database that contains data that characterize the ANN. The system is done in MATLAB, V5.2, NN Toolbox V2.O.
Spectral properties of Czech vowels in spontaneous speech (preliminary analysis)
Dohalská, Marie ; Duběda, T. ; Bartošová, H. ; Mejvaldová, J.
For the study of vowel spectra in spontaneous speech, we used a dialogue of two educated speakers. The data were compared 1) to laboratory sentences pronouced by the same speaker, 2) to reference data. Both hypotheses - 1) there is more centralization in spontaneous speech, 2) laboratory sentences are closer to the reference values - were confirmed. The influence of stress turned out to be inconsiderable, but the difference of length is significant. The differences are also due to regional background.
Speech Processing
Vích, Robert
Speech Processing is a periodic publication collecting the contributions presented at the Czech-German Workshops on Speech Processing organized every year in September in Prague. The papers are devoted to speech analysis, synthesis, recognition and phonetics.
FIR vocal tract model
Vích, Robert
In the paper a new parametric speech modelling approach based on homomorphic signal processing using spectral analysis is presented. The exponential function in the cepstral vocal tract model is approximated by a finite MacLaurin expansion, which is implemented by a FIR digital filter. The cepstral coefficients are used as coefficients of an another FIR digital filter, which is introduced in the FIR digital filter instead of the delay blocks.
New design of combined inventory for Czech text-to-speech synthesis
Hesounová, Alžběta
A new inventory for Czech text-to-speech synthesis is currently developed. Its core consists of triphone segments, the number of all segments being about 1850. Apart from triphones, the inventory will also contain separate segments for vowel bodies and sentence-initial and sentence-final consonants. A special attention is given to consonants in clusters that are treated with respect to the neighbouring speech sounds. The new system is going to work on 16kHz sampling frequency.

Národní úložiště šedé literatury : Nalezeno 44 záznamů.   1 - 10dalšíkonec  přejít na záznam:
Chcete být upozorněni, pokud se objeví nové záznamy odpovídající tomuto dotazu?
Přihlásit se k odběru RSS.