keywords:"Pražský závislostní korpus" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"Pražský závislostní korpus"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Czech-English Translation Petrželka, Jiří ; Schmidt, Marek (referee) ; Smrž, Pavel (advisor) Tato diplomová práce popisuje principy statistického strojového překladu a demonstruje, jak sestavit systém pro statistický strojový překlad Moses. V přípravné fázi jsou prozkoumány volně dostupné bilingvní česko-anglické korpusy. Empirická analýza časové náročnosti vícevláknových nástrojů pro zarovnání slov demonstruje, že MGIZA++ může dosáhnout až pětinásobného zrychlení, zatímco PGIZA++ až osminásobného zrychlení (v porovnání s GIZA++). Jsou otestovány tři způsoby morfologického pre-processingu českých trénovacích dat za použití jednoduchých nefaktorových modelů. Zatímco jednoduchá lemmatizace může snížit BLEU, sofistikovanější přístupy většinou BLEU zvyšují. Positivní efekty morfologického pre-processingu se vytrácejí s růstem velikosti korpusu. Vztah mezi dalšími charakteristikami korpusu (velikost, žánr, další data) a výsledným BLEU je empiricky měřen. Koncový systém je natrénován na korpusu CzEng 0.9 a vyhodnocen na testovacím vzorku z workshopu WMT 2010. Detailed record
	Discourse Connectives in Czech.(From Centre to Periphery) Rysová, Magdaléna ; Hajičová, Eva (advisor) ; Malá, Markéta (referee) ; Tárnyiková, Jarmila (referee) Magdaléna Rysová Discourse Connectives in Czech (From Centre to Periphery) Abstract The thesis focuses on description and analysis of discourse connectives in Czech in broader sense, i.e. by which language means it is possible to express sense relation within a text. The thesis is not limited to any parts of speech (like conjunctions or structuring particles) but it tries to find and describe all language means in Czech with the ability to connect two pieces or units of a text into one coherent complex. The thesis investigates discourse connectives in Czech with respect to the so called secondary connectives (i.e. mainly multiword phrases like to je důvod, proč - that is the reason why; kvůli těmto skutečnostem - due to these facts etc., in opposition to primary connectives like však - however, nebo - or, a - and, ale - but, proto - therefore etc.). Discourse connectives are (in general terms) understood as language expressions that signal semantico-pragmatic relations within a text. However, there are many theories that significantly differ in the concrete description of these expressions. Therefore, there is not a generally accepted and universal definition of discourse connectives and their description and characteristics is still a matter of linguistic discussion. The aim of this thesis is to contribute... Detailed record
	Discourse Connectives in Czech.(From Centre to Periphery) Rysová, Magdaléna ; Hajičová, Eva (advisor) ; Malá, Markéta (referee) ; Tárnyiková, Jarmila (referee) Magdaléna Rysová Discourse Connectives in Czech (From Centre to Periphery) Abstract The thesis focuses on description and analysis of discourse connectives in Czech in broader sense, i.e. by which language means it is possible to express sense relation within a text. The thesis is not limited to any parts of speech (like conjunctions or structuring particles) but it tries to find and describe all language means in Czech with the ability to connect two pieces or units of a text into one coherent complex. The thesis investigates discourse connectives in Czech with respect to the so called secondary connectives (i.e. mainly multiword phrases like to je důvod, proč - that is the reason why; kvůli těmto skutečnostem - due to these facts etc., in opposition to primary connectives like však - however, nebo - or, a - and, ale - but, proto - therefore etc.). Discourse connectives are (in general terms) understood as language expressions that signal semantico-pragmatic relations within a text. However, there are many theories that significantly differ in the concrete description of these expressions. Therefore, there is not a generally accepted and universal definition of discourse connectives and their description and characteristics is still a matter of linguistic discussion. The aim of this thesis is to contribute... Detailed record
	Czech-English Translation Petrželka, Jiří ; Schmidt, Marek (referee) ; Smrž, Pavel (advisor) Tato diplomová práce popisuje principy statistického strojového překladu a demonstruje, jak sestavit systém pro statistický strojový překlad Moses. V přípravné fázi jsou prozkoumány volně dostupné bilingvní česko-anglické korpusy. Empirická analýza časové náročnosti vícevláknových nástrojů pro zarovnání slov demonstruje, že MGIZA++ může dosáhnout až pětinásobného zrychlení, zatímco PGIZA++ až osminásobného zrychlení (v porovnání s GIZA++). Jsou otestovány tři způsoby morfologického pre-processingu českých trénovacích dat za použití jednoduchých nefaktorových modelů. Zatímco jednoduchá lemmatizace může snížit BLEU, sofistikovanější přístupy většinou BLEU zvyšují. Positivní efekty morfologického pre-processingu se vytrácejí s růstem velikosti korpusu. Vztah mezi dalšími charakteristikami korpusu (velikost, žánr, další data) a výsledným BLEU je empiricky měřen. Koncový systém je natrénován na korpusu CzEng 0.9 a vyhodnocen na testovacím vzorku z workshopu WMT 2010. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English