National Repository of Grey Literature 12 records found  1 - 10next  jump to record: Search took 0.00 seconds. 
Text to Audio Alignment
Šikula, Vojtěch ; Beneš, Karel (referee) ; Szőke, Igor (advisor)
This bachelors thesis is dealing with text to audio alignment. I present here works which are dealing with same problem. For evaluation have been used data from MGB Challenge 2015. Technique used here is using phoneme transcription and its alignment with transcript. Alignment was done with different models. The best results have been achieved by intersection of two alignments from models from good records.
Voice Activity Detection
Břenek, Roman ; Grézl, František (referee) ; Matějka, Pavel (advisor)
This thesis describes techniques for voice activity detection in audio recordings. It is necessary to  correctly classify all non-speech segments and recognize speech with noisy background.  The whole process of voice activity detection (VAD) is described in this thesis, i.e. digitizing audio  signal, feature extraction, training of the system, post-processing and final evaluation. There are  three different systems compared within the thesis . The first one is based on phoneme recognition using neural network, the other two are variations of Gaussian Mixture Models (GMM). Each system was tested on three data sets - Tactical Speaker Identification Speech Corpus (TSID), Ham Radio (HR) and Rich Transcription Evaluation (RT05-RT07). The best results of each system are compared with the results of the third side.
Text to Audio Alignment
Šuba, Adam ; Hradiš, Michal (referee) ; Szőke, Igor (advisor)
This bachelor thesis studies a tool for automatic text to audio alignment at the level of single phonemes and graphemes. It also discusses possible techniques used in alignment and possible limitations and difficulties that need to be taken into account. Studied tool uses approach based on grapheme-to-phoneme conversion using joint-sequence models. Data used in experiments are TV broadcast recordings from Multi-Genre Broadcast Challenge 2015.
Voice Conversion
Hodaň, David ; Novotný, Ondřej (referee) ; Černocký, Jan (advisor)
Voice conversion is the process of transformation of speech parameters belonging to one speaker in such a way that his/her speech sounds as spoken by someone else. This thesis presents a short summary of several techniques currently used for conversion. First, the theory of voice creation with an emphasis on key atributes that characterize and identify a speaker’s voice is described. Methods for voice modification are discussed, together with the advantages and pitfalls that predetermine the use-cases for suitable application of these methods. A high-level overview of how speech is transformed between the source and the target speakers is presented. This description is subsequently used to design a voice conversion system that is aimed to demonstrate one of the possible approaches to the conversion problem. The process of conversion consists of two phases: training and synthesis. As part of this project, a computer program for voice conversion based on the MATLAB programming environment has been developed. Its design, implementation and results are discussed.
Modification of Speech Rate
Kovářík, Aleš ; Schwarz, Petr (referee) ; Szőke, Igor (advisor)
This diploma thesis discusses modification of a speech rate. The PSOLA (Pitch Synchronous OverLap Add) method was used for the rate modification. This algorithm works in time domain. Another method -- phase vocoder, which works in frequency domain is also presented in an overview. This thesis extends the PSOLA method with a phoneme recognition, which allows for better understandability of the speech output by considering characteristics of the phonemes beeing pronounced. To examine this proposed method, an application connecting PSOLA and a phoneme recognizer was developed.
Voice Conversion
Lukáč, Peter ; Glembek, Ondřej (referee) ; Černocký, Jan (advisor)
Predmetom tejto práce je konverzia hlasu. Konverzia hlasu predstavuje preberanie reči jedného rečníka, ktorého nazývame zdrojový rečník a transformovanie tejto reči na reč ktorá znie ako reč druhého rečníka, ktorého nazývame cieľový rečník. Toto je dosiahnuté pomocou systému pre konverziu hlasu, ktorý je popísaný v tejto práci. Ako framework pre analýzu a syntézu reči používame STRAIGHT, ktorý bol dominantne používaný vo Voice Conversion Challenge 2016. Náš system pre konverziu hlasu je založený na konverzii spectra použitím doprednej neurónovej siete a paralelného trénovania.
Voice Conversion
Lukáč, Peter ; Glembek, Ondřej (referee) ; Černocký, Jan (advisor)
Predmetom tejto práce je konverzia hlasu. Konverzia hlasu predstavuje preberanie reči jedného rečníka, ktorého nazývame zdrojový rečník a transformovanie tejto reči na reč ktorá znie ako reč druhého rečníka, ktorého nazývame cieľový rečník. Toto je dosiahnuté pomocou systému pre konverziu hlasu, ktorý je popísaný v tejto práci. Ako framework pre analýzu a syntézu reči používame STRAIGHT, ktorý bol dominantne používaný vo Voice Conversion Challenge 2016. Náš system pre konverziu hlasu je založený na konverzii spectra použitím doprednej neurónovej siete a paralelného trénovania.
Text to Audio Alignment
Šuba, Adam ; Hradiš, Michal (referee) ; Szőke, Igor (advisor)
This bachelor thesis studies a tool for automatic text to audio alignment at the level of single phonemes and graphemes. It also discusses possible techniques used in alignment and possible limitations and difficulties that need to be taken into account. Studied tool uses approach based on grapheme-to-phoneme conversion using joint-sequence models. Data used in experiments are TV broadcast recordings from Multi-Genre Broadcast Challenge 2015.
Text to Audio Alignment
Šikula, Vojtěch ; Beneš, Karel (referee) ; Szőke, Igor (advisor)
This bachelors thesis is dealing with text to audio alignment. I present here works which are dealing with same problem. For evaluation have been used data from MGB Challenge 2015. Technique used here is using phoneme transcription and its alignment with transcript. Alignment was done with different models. The best results have been achieved by intersection of two alignments from models from good records.
Voice Conversion
Hodaň, David ; Novotný, Ondřej (referee) ; Černocký, Jan (advisor)
Voice conversion is the process of transformation of speech parameters belonging to one speaker in such a way that his/her speech sounds as spoken by someone else. This thesis presents a short summary of several techniques currently used for conversion. First, the theory of voice creation with an emphasis on key atributes that characterize and identify a speaker’s voice is described. Methods for voice modification are discussed, together with the advantages and pitfalls that predetermine the use-cases for suitable application of these methods. A high-level overview of how speech is transformed between the source and the target speakers is presented. This description is subsequently used to design a voice conversion system that is aimed to demonstrate one of the possible approaches to the conversion problem. The process of conversion consists of two phases: training and synthesis. As part of this project, a computer program for voice conversion based on the MATLAB programming environment has been developed. Its design, implementation and results are discussed.

National Repository of Grey Literature : 12 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.