Original title: Rychlý a trénovatelný tokenizér pro přirozené jazyky
Translated title: Rychlý a trénovatelný tokenizér pro přirozené jazyky
Authors: Maršík, Jiří ; Bojar, Ondřej (advisor) ; Spousta, Miroslav (referee)
Document type: Bachelor's theses
Year: 2011
Language: eng
Abstract: [eng] [cze]

Keywords: maximum entropy; segmentaion; text preprocessing; tokenization; maximální entropie; předzpracování textu; segmentace; tokenizace

Institution: Charles University Faculties (theses) (web)
Document availability information: Available in the Charles University Digital Repository.
Original record: http://hdl.handle.net/20.500.11956/50274

Permalink: http://www.nusl.cz/ntk/nusl-314563


The record appears in these collections:
Universities and colleges > Public universities > Charles University > Charles University Faculties (theses)
Academic theses (ETDs) > Bachelor's theses
 Record created 2017-05-09, last modified 2022-03-04


No fulltext
  • Export as DC, NUŠL, RIS
  • Share