|
NNLab - platform base for text-to-speech synthesis
Santarius, J. ; Tučková, Jana
Prosody modelling in synthetic speech is one of the possible applications of artificial neural nets (ANN). The training and testing files for prosody modelling by ANN must be large; therefore it is useful to automatize the process. The modular program system NNLab is a powerful tool for an easy-to-operate ANN system for prosody modelling of synthetic speech. In the center of NNL environment is a database that contains data that characterize the ANN. The system is done in MATLAB, V5.2, NN Toolbox V2.O.
|
| |
|
Speech Processing
Vích, Robert
Speech Processing is a periodic publication collecting the contributions presented at the Czech-German Workshops on Speech Processing organized every year in September in Prague. The papers are devoted to speech analysis, synthesis, recognition and phonetics.
|
|
FIR vocal tract model
Vích, Robert
In the paper a new parametric speech modelling approach based on homomorphic signal processing using spectral analysis is presented. The exponential function in the cepstral vocal tract model is approximated by a finite MacLaurin expansion, which is implemented by a FIR digital filter. The cepstral coefficients are used as coefficients of an another FIR digital filter, which is introduced in the FIR digital filter instead of the delay blocks.
|
|
New design of combined inventory for Czech text-to-speech synthesis
Hesounová, Alžběta
A new inventory for Czech text-to-speech synthesis is currently developed. Its core consists of triphone segments, the number of all segments being about 1850. Apart from triphones, the inventory will also contain separate segments for vowel bodies and sentence-initial and sentence-final consonants. A special attention is given to consonants in clusters that are treated with respect to the neighbouring speech sounds. The new system is going to work on 16kHz sampling frequency.
|
| |