National Repository of Grey Literature 20 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Speaker recognition
Kašpar, Ladislav ; Atassi, Hicham (referee) ; Sysel, Petr (advisor)
My bachelor thesis is devoted to the problem of speaker recognition. It includes the basic theory on this topic. The theory focuses on the calculation of parameters for speaker recognition and description of the procedure for speaker recognition. An application for speaker recognition has been written in Matlab. It uses techniques as frequency formants, cepstral coefficients and segmentation of the signal as the main parameters.
Speaker Recognition in Skype Calls
Kaňok, Tomáš ; Glembek, Ondřej (referee) ; Schwarz, Petr (advisor)
This diploma thesis is concerned with machine identification and verification of speaker, it's theory and applications. It evaluates existing implementation of the subject by the Speech@FIT group. It also considers plugins for the Skype program. Then a functioning plugin is proposed which makes possible identification of the speaker. It is implemented and evaluated here. Towards the end of this thesis suggestions of future development are presented.
Intersession Variability Compensation in Language and Speaker Identification
Hubeika, Valiantsina ; Burget, Lukáš (referee) ; Matějka, Pavel (advisor)
Variabilita kanálu a hovoru je velmi důležitým problémem v úloze rozpoznávání mluvčího. V současné době je ve velkém množství vědeckých článků uvedeno několik technik pro kompenzaci vlivu kanálu. Kompenzace vlivu kanálu může být implementována jak v doméně modelu, tak i v doménách příznaků i skóre. Relativně nová výkoná technika je takzvaná eigenchannel adaptace pro GMM (Gaussian Mixture Models). Mevýhodou této metody je nemožnost její aplikace na jiné klasifikátory, jako napřílad takzvané SVM (Support Vector Machines), GMM s různým počtem Gausových komponent nebo v rozpoznávání řeči s použitím skrytých markovových modelů (HMM). Řešením může být aproximace této metody, eigenchannel adaptace v doméně příznaků. Obě tyto techniky, eigenchannel adaptace v doméně modelu a doméně příznaků v systémech rozpoznávání mluvčího, jsou uvedeny v této práci. Po dosažení dobrých výsledků v rozpoznávání mluvčího, byl přínos těchto technik zkoumán pro akustický systém rozpoznávání jazyka zahrnující 14 jazyků. V této úloze má nežádoucí vliv nejen variabilita kanálu, ale i variabilita mluvčího. Výsledky jsou prezentovány na datech definovaných pro evaluaci rozpoznávání mluvčího z roku 2006 a evaluaci rozpoznávání jazyka v roce 2007, obě organizované Amerických Národním Institutem pro Standard a Technologie (NIST)
Speaker Recognition on Mobile Phone
Pešán, Jan ; Glembek, Ondřej (referee) ; Černocký, Jan (advisor)
Tato práce se zaměřuje na implementaci počítačového systému rozpoznávání řečníka do prostředí mobilního telefonu. Je zde popsán princip, funkce, a implementace rozpoznávače na mobilním telefonu Nokia N900.
Speakers identification using ALIZE library
Skulínek, Zdeněk ; Burget, Radim (referee) ; Přinosil, Jiří (advisor)
Recently computing power has been significantly increasing and speaker identification systems have been drawing to the foreground among others. Many libraries have been developed, no matter if open--source or proprietary. One of them is for example ALIZE that was developed at Avignon university. This library offers general concept for staff with biometric values, statistic computations and complex framework for intended application. The goal of this thesis is to test this library, namely by constructing a practical application for speaker identification. This thesis describes successful assembly of that application. Application is platform independent and it is also tested on three basic platforms. These platforms are attached as files into VirtualBox virtual machine. This thesis takes notice of practical experiences with ALIZE library and also describes it's structure and architecture. There are 50 patterns of 9 speakers used for testing in average length 20 seconds. Application recognizes speakers very reliable in these given patterns. Therefore real neccesary length test for speaker recognition was accomplished at the end of this thesis.
Multiplatform Application for Speaker Verification
Görig, Jan ; Matějka, Pavel (referee) ; Glembek, Ondřej (advisor)
Bachelor thesis considers speaker recognition without knowledge of spoken message. There are described current feature extraction methods and their evaluation using Gaussian mixture model. The practical output of this work is application for visualization of the recognition process. Developed application is cross platform and it uses Qt and BSAPI libraries.
Modelling Prosodic Dynamics for Speaker Recognition
Jančík, Zdeněk ; Fapšo, Michal (referee) ; Matějka, Pavel (advisor)
Most current automatic speaker recognition system extract speaker-depend features by looking at short-term spectral information. This approach ignores long-term information. I explored approach that use the fundamental frequency and energy trajectories for each speaker. This approach models prosody dynamics on single fonemes or syllables. It is known from literature that prosodic systems do not work as well the acoustic one but it improve the system when fusing. I verified this assumption by fusing my results with state of the art acoustic system from BUT. Data from standard evaluation campaigns organized by National Institute of Standarts and Technology are used for all experiments.
Use of glottalization as a factor enabling speaker identification
Skákal, Ladislav ; Skarnitzl, Radek (advisor) ; Bořil, Tomáš (referee)
While handling the task of speaker identification, forensic phoneticians use a combination of various parameters contained in different levels of speech signal. The main aim of the present thesis is to explore whether glottalization in Czech may be considered as a potentially useful parameter in this sense. In our research, we focus on the rate of prevocalic glottalization at word boundaries and we distinguish between different realisations of glottalization: canonical glottal stop and its hypoarticulated form - creaky voice. The studied material consists of repeated recordings of three male and four female speakers and contains both read text and spontaneous speech. The results do not indicate that the same speaker would use glottalization differently in the first and second recording, but a difference in glottalization is found between speakers. From the forensic phonetics point of view, this finding seems to be useful. Marginally, some other factors which are not directly connected with the speaker (height of following vowel, lexical factors and speech rate) were examined, but no influence on glottalization was found. Keywords: glottal stop, glottalization, forensic phonetics, speaker identification
Ways of exploiting fundamental frequency for speaker identification
Hývlová, Dita ; Skarnitzl, Radek (advisor) ; Šturm, Pavel (referee)
The present Master's thesis deals with the forensic use of fundamental frequency characteristics, specifically with F0 mean values and indicators of variability. Phoneticians who specialise in the forensic analysis of speech generally believe that F0 does not hold much potential as a parameter useful for speaker identification, mainly because it is easily influenced by extrinsic factors (e.g. the speaker's emotional state, interfering noise, transmission channel or even the speaker's own effort to mask his voice), which cause high intra-individual variability. Despite these facts, however, the forensic use of F0 offers a number of advantages, namely straightforward extraction from the speech signal and lower susceptibility to varying lexical content - unlike, for example, vowel formants. This thesis investigates the recordings of 8 male speakers made in two different speech styles (spontaneous and read) and compares the respective indicators of F0 stability and variability, in particular those that are robust in varying external conditions: that is, the baseline for mean values and the 10.-90. percentile range for variability indicators. Apart from that, we take into account phenomena such as the creaky voice, which are idiosyncratic and contribute to easier speaker discrimination. Key words:...
Use of glottalization as a factor enabling speaker identification
Skákal, Ladislav ; Skarnitzl, Radek (advisor) ; Bořil, Tomáš (referee)
While handling the task of speaker identification, forensic phoneticians use a combination of various parameters contained in different levels of speech signal. The main aim of the present thesis is to explore whether glottalization in Czech may be considered as a potentially useful parameter in this sense. In our research, we focus on the rate of prevocalic glottalization at word boundaries and we distinguish between different realisations of glottalization: canonical glottal stop and its hypoarticulated form - creaky voice. The studied material consists of repeated recordings of three male and four female speakers and contains both read text and spontaneous speech. The results do not indicate that the same speaker would use glottalization differently in the first and second recording, but a difference in glottalization is found between speakers. From the forensic phonetics point of view, this finding seems to be useful. Marginally, some other factors which are not directly connected with the speaker (height of following vowel, lexical factors and speech rate) were examined, but no influence on glottalization was found. Keywords: glottal stop, glottalization, forensic phonetics, speaker identification

National Repository of Grey Literature : 20 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.