keywords:"Fonémový Rozpoznávač" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"Fonémový Rozpoznávač"

Search:



Search Tips :: Simple Search

Search collections:

Sort by:	Display results:	Output format:

	Text to Audio Alignment Šikula, Vojtěch ; Beneš, Karel (referee) ; Szőke, Igor (advisor) This bachelors thesis is dealing with text to audio alignment. I present here works which are dealing with same problem. For evaluation have been used data from MGB Challenge 2015. Technique used here is using phoneme transcription and its alignment with transcript. Alignment was done with different models. The best results have been achieved by intersection of two alignments from models from good records. Detailed record
	Voice Activity Detection Břenek, Roman ; Grézl, František (referee) ; Matějka, Pavel (advisor) This thesis describes techniques for voice activity detection in audio recordings. It is necessary to correctly classify all non-speech segments and recognize speech with noisy background. The whole process of voice activity detection (VAD) is described in this thesis, i.e. digitizing audio signal, feature extraction, training of the system, post-processing and final evaluation. There are three different systems compared within the thesis . The first one is based on phoneme recognition using neural network, the other two are variations of Gaussian Mixture Models (GMM). Each system was tested on three data sets - Tactical Speaker Identification Speech Corpus (TSID), Ham Radio (HR) and Rich Transcription Evaluation (RT05-RT07). The best results of each system are compared with the results of the third side. Detailed record
	Text to Audio Alignment Šuba, Adam ; Hradiš, Michal (referee) ; Szőke, Igor (advisor) This bachelor thesis studies a tool for automatic text to audio alignment at the level of single phonemes and graphemes. It also discusses possible techniques used in alignment and possible limitations and difficulties that need to be taken into account. Studied tool uses approach based on grapheme-to-phoneme conversion using joint-sequence models. Data used in experiments are TV broadcast recordings from Multi-Genre Broadcast Challenge 2015. Detailed record
	Voice Conversion Hodaň, David ; Novotný, Ondřej (referee) ; Černocký, Jan (advisor) Voice conversion is the process of transformation of speech parameters belonging to one speaker in such a way that his/her speech sounds as spoken by someone else. This thesis presents a short summary of several techniques currently used for conversion. First, the theory of voice creation with an emphasis on key atributes that characterize and identify a speaker’s voice is described. Methods for voice modification are discussed, together with the advantages and pitfalls that predetermine the use-cases for suitable application of these methods. A high-level overview of how speech is transformed between the source and the target speakers is presented. This description is subsequently used to design a voice conversion system that is aimed to demonstrate one of the possible approaches to the conversion problem. The process of conversion consists of two phases: training and synthesis. As part of this project, a computer program for voice conversion based on the MATLAB programming environment has been developed. Its design, implementation and results are discussed. Detailed record
	Modification of Speech Rate Kovářík, Aleš ; Schwarz, Petr (referee) ; Szőke, Igor (advisor) This diploma thesis discusses modification of a speech rate. The PSOLA (Pitch Synchronous OverLap Add) method was used for the rate modification. This algorithm works in time domain. Another method -- phase vocoder, which works in frequency domain is also presented in an overview. This thesis extends the PSOLA method with a phoneme recognition, which allows for better understandability of the speech output by considering characteristics of the phonemes beeing pronounced. To examine this proposed method, an application connecting PSOLA and a phoneme recognizer was developed. Detailed record
	Voice Conversion Lukáč, Peter ; Glembek, Ondřej (referee) ; Černocký, Jan (advisor) Predmetom tejto práce je konverzia hlasu. Konverzia hlasu predstavuje preberanie reči jedného rečníka, ktorého nazývame zdrojový rečník a transformovanie tejto reči na reč ktorá znie ako reč druhého rečníka, ktorého nazývame cieľový rečník. Toto je dosiahnuté pomocou systému pre konverziu hlasu, ktorý je popísaný v tejto práci. Ako framework pre analýzu a syntézu reči používame STRAIGHT, ktorý bol dominantne používaný vo Voice Conversion Challenge 2016. Náš system pre konverziu hlasu je založený na konverzii spectra použitím doprednej neurónovej siete a paralelného trénovania. Detailed record
	Voice Conversion Lukáč, Peter ; Glembek, Ondřej (referee) ; Černocký, Jan (advisor) Predmetom tejto práce je konverzia hlasu. Konverzia hlasu predstavuje preberanie reči jedného rečníka, ktorého nazývame zdrojový rečník a transformovanie tejto reči na reč ktorá znie ako reč druhého rečníka, ktorého nazývame cieľový rečník. Toto je dosiahnuté pomocou systému pre konverziu hlasu, ktorý je popísaný v tejto práci. Ako framework pre analýzu a syntézu reči používame STRAIGHT, ktorý bol dominantne používaný vo Voice Conversion Challenge 2016. Náš system pre konverziu hlasu je založený na konverzii spectra použitím doprednej neurónovej siete a paralelného trénovania. Detailed record
	Text to Audio Alignment Šuba, Adam ; Hradiš, Michal (referee) ; Szőke, Igor (advisor) This bachelor thesis studies a tool for automatic text to audio alignment at the level of single phonemes and graphemes. It also discusses possible techniques used in alignment and possible limitations and difficulties that need to be taken into account. Studied tool uses approach based on grapheme-to-phoneme conversion using joint-sequence models. Data used in experiments are TV broadcast recordings from Multi-Genre Broadcast Challenge 2015. Detailed record
	Text to Audio Alignment Šikula, Vojtěch ; Beneš, Karel (referee) ; Szőke, Igor (advisor) This bachelors thesis is dealing with text to audio alignment. I present here works which are dealing with same problem. For evaluation have been used data from MGB Challenge 2015. Technique used here is using phoneme transcription and its alignment with transcript. Alignment was done with different models. The best results have been achieved by intersection of two alignments from models from good records. Detailed record
	Voice Conversion Hodaň, David ; Novotný, Ondřej (referee) ; Černocký, Jan (advisor) Voice conversion is the process of transformation of speech parameters belonging to one speaker in such a way that his/her speech sounds as spoken by someone else. This thesis presents a short summary of several techniques currently used for conversion. First, the theory of voice creation with an emphasis on key atributes that characterize and identify a speaker’s voice is described. Methods for voice modification are discussed, together with the advantages and pitfalls that predetermine the use-cases for suitable application of these methods. A high-level overview of how speech is transformed between the source and the target speakers is presented. This description is subsequently used to design a voice conversion system that is aimed to demonstrate one of the possible approaches to the conversion problem. The process of conversion consists of two phases: training and synthesis. As part of this project, a computer program for voice conversion based on the MATLAB programming environment has been developed. Its design, implementation and results are discussed. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English