Translated title: Entity retrieval on Wikipedia in the scope of the gikiCLEF track
Authors: Duarte Torres, Sergio Raul ; Pecina, Pavel (advisor) ; Žabokrtský, Zdeněk (referee)
Document type: Master’s theses
Year: 2009
Language: eng
Abstract: This thesis presents a system to retrieve entities specified by a question or description given in natural language, this description indicates the entity type and the properties that the entities need to satisfy. This task is analogous to the one proposed in the GikiCLEF 2009 track. The system is fed with the Spanish Wikipedia Collection of 2008 and every entity is represented by a Wikipage. We propose three novel methods to perform query expansion in the problem of entity retrieval. We also introduce a novel method to employ the English Yago and DBpedia semantic resources to determine the target named entity type; this method is used to improve previous approaches in which the target NE type is based solely on Wikipedia categories. We show that our system obtains promising results when we evaluate its performance in the GikiCLEF 2009 topic list and compare the results with the other participants of the track.

Institution: Charles University Faculties (theses) (web)
Document availability information: Available in the Charles University Digital Repository.
Original record: http://hdl.handle.net/20.500.11956/23299

Permalink: http://www.nusl.cz/ntk/nusl-495594


The record appears in these collections:
Universities and colleges > Public universities > Charles University > Charles University Faculties (theses)
Academic theses (ETDs) > Master’s theses
 Record created 2022-05-08, last modified 2022-05-09


No fulltext
  • Export as DC, NUŠL, RIS
  • Share