| |
| |
| |
| |
|
Od korpusu jako otevřeného zdroje pro bádání ke komerčním produktům
Šimandl, Josef
The development of corpora is sketched, from large collections of texts without tagging through tagged corpora to machines that operate above tagged corpora and produce data presented as data about language, such as Word Sketches (TM). The article remarks that every corpus is merely a representation of texts and that the quality of representation is to be examined. The unavoidable question in research is how is the corpus built and how, under what principles, the service software operates. Both in case we explore a corpus with distortions, where texts appear in a way nobody has written them so (digits and their environment uses to be phenomena of that sort), and in case we are not allowed to have an insight "below the bonnet" or to change working parameters, we hardly may speak about doing scholarly research.
|
| |
|
The Czech word prý/prej: possibilities for functional and semantic differentiation
Hoffmannová, Jana ; Kolářová, I.
Ways of quoting, reproducing, paraphrasing the speech of somebody else or one´s own speech; prý as a neutral signal of quoting, as a modal particle signalling the speaker´s uncertainty, doubts, etc.of the quoted utterance; prý as a textual transition between direct and indirect speech. Data from Czech National Corpus enable to study combinations of prý with other modal particles, to analyze syntactic and stylistic distribution of prý (its specific functions in narratives, in press interviews, etc.).
|
| |
| |