National Repository of Grey Literature 6 records found  Search took 0.00 seconds. 
NNLab - platform base for text-to-speech synthesis
Santarius, J. ; Tučková, Jana
Prosody modelling in synthetic speech is one of the possible applications of artificial neural nets (ANN). The training and testing files for prosody modelling by ANN must be large; therefore it is useful to automatize the process. The modular program system NNLab is a powerful tool for an easy-to-operate ANN system for prosody modelling of synthetic speech. In the center of NNL environment is a database that contains data that characterize the ANN. The system is done in MATLAB, V5.2, NN Toolbox V2.O.
Spectral properties of Czech vowels in spontaneous speech (preliminary analysis)
Dohalská, Marie ; Duběda, T. ; Bartošová, H. ; Mejvaldová, J.
For the study of vowel spectra in spontaneous speech, we used a dialogue of two educated speakers. The data were compared 1) to laboratory sentences pronouced by the same speaker, 2) to reference data. Both hypotheses - 1) there is more centralization in spontaneous speech, 2) laboratory sentences are closer to the reference values - were confirmed. The influence of stress turned out to be inconsiderable, but the difference of length is significant. The differences are also due to regional background.
Speech Processing
Vích, Robert
Speech Processing is a periodic publication collecting the contributions presented at the Czech-German Workshops on Speech Processing organized every year in September in Prague. The papers are devoted to speech analysis, synthesis, recognition and phonetics.
FIR vocal tract model
Vích, Robert
In the paper a new parametric speech modelling approach based on homomorphic signal processing using spectral analysis is presented. The exponential function in the cepstral vocal tract model is approximated by a finite MacLaurin expansion, which is implemented by a FIR digital filter. The cepstral coefficients are used as coefficients of an another FIR digital filter, which is introduced in the FIR digital filter instead of the delay blocks.
New design of combined inventory for Czech text-to-speech synthesis
Hesounová, Alžběta
A new inventory for Czech text-to-speech synthesis is currently developed. Its core consists of triphone segments, the number of all segments being about 1850. Apart from triphones, the inventory will also contain separate segments for vowel bodies and sentence-initial and sentence-final consonants. A special attention is given to consonants in clusters that are treated with respect to the neighbouring speech sounds. The new system is going to work on 16kHz sampling frequency.
Speech model quality comparison based on the spectrogram method
Přibil, Jiří
The spectrogram is a useful tool for quality comparison of the synthetic speech. In the contribution the advantages and disadvantages of this visual comparison method are discussed.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.