National Repository of Grey Literature 107 records found  beginprevious56 - 65nextend  jump to record: Search took 0.02 seconds. 
Syntax in methods for information retrieval
Straková, Jana
Title: Information Retrieval Using Syntax Information Author: Bc. Jana Kravalová Department: Institute of Formal and Applied Linguistics Supervisor: Mgr. Pavel Pecina, Ph.D. Supervisor's e-mail address: pecina@ufal.mff.cuni.cz Abstract: In the last years, application of language modeling in infor- mation retrieval has been studied quite extensively. Although language models of any type can be used with this approach, only traditional n-gram models based on surface word order have been employed and described in published experiments (often only unigram language models). The goal of this thesis is to design, implement, and evaluate (on Czech data) a method which would extend a language model with syntactic information, automatically obtained from documents and queries. We attempt to incorporate syntactic information into language models and experimentally compare this approach with uni- gram and bigram model based on surface word order. We also empirically compare methods for smoothing, stemming and lemmatization, effectiveness of using stopwords and pseudo relevance feedback. We perform a detailed ana- lysis of these retrieval methods and describe their performance in detail. Keywords: information retrieval, language modelling, depenency syntax, smo- othing
Searching relevant articles in extensive collections
Vojt, Ján ; Novák, Jiří (advisor) ; Bartoš, Tomáš (referee)
Searching text in articles is usually implemented with fulltext search. Using more advanced techniques however, it is possible to achieve significantly better results. The subject of this work is to create a universal library for searching extensible collections, specialized in czech language. The library makes use of tools capable of working with morphology while considering importance of words. It also conducts an experiment with word pairs, which adds context into the search process. The success rate of this experiment is tried on an extensible collection of data. Created library is a unique tool for processing extensible collections of czech text, while at the same time it is ready for further extension by new languages and methods.
Ways of Dissemination, Usage and Impact Tracking of Electronic Theses and Dissertations (ETDs)
Kettler, Meinhard
The digital transformation has had a tremendous impact on graduate research workflows and output. Most theses are submitted as ETDs, although the share varies by country and by subject. Universities worldwide are running institutional repositories to showcase new graduate research, as well as recently digitized material. The presentation will highlight studies on dissertation and theses usage on repositories as well as giving insights into ProQuest’s unique dissertations and theses analytics.
Fulltext: idr-1035_1 - Download fulltextPDF
Slides: idr-1035_2 - Download fulltextPDF; idr-1035_3 - Download fulltextPDF
Video: idr-1035_4 - Download fulltextMP4
Online Subject Searching of Dissertations
Bratková, Eva
This paper evaluates searching for doctoral dissertations by topic in various online systems. The situation in the Czech Republic is introduced, including the problems involved in completing successful topical searches for dissertations – is it possible to find all relevant materials, or is it sufficient just to find something? The Czech situation is then compared with how systems abroad, particularly in the United States, are being implemented with new access routes for dissertations in the form of linked open data, in which controlled vocabularies of subject terms figure prominently. The paper also discusses how selected European systems whose dissertations are already presented in the WorldCat database will cope with a challenge: “... Over time, these references [for topic entities] will be replaced with persistent URIs to... Linked Data resources”?
Fulltext: idr-1034_3 - Download fulltextPDF; idr-1034_4 - Download fulltextPDF
Slides: idr-1034_1 - Download fulltextPDF; idr-1034_2 - Download fulltextPDF
Video: idr-1034_5 - Download fulltextMP4
Library for Support of ReReSearch System Development
Heller, Stanislav ; Otrusina, Lubomír (referee) ; Šperka, Svatopluk (advisor)
At this time, the development of the ReReSearch system is significantly slowed down by mutual incompatibility of system modules, by the fact that developers often repeat already known mistakes and of course by poor communication between developers in general. To solve this problem, there was a need to create a component which would implement and unify often performed tasks in development of ReReSearch system and this way to spend time of ReReSearch developers. The result of this effort is so-called "rrslib" - a Python library, which is supposed to be a helper for everyone, who works on parts of ReReSearch project: database, data extractors, web-based agents, crawlers, XML-processing etc. The library should serve for more consistent, faster and more reliable development of ReReSearch system.

National Repository of Grey Literature : 107 records found   beginprevious56 - 65nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.