Original title: Čištění, extrakce textu a převod webových stránek do vertikálního formátu
Translated title: Cleaning, extraction of text and transformation of web pages into vertical format
Authors: Švaňa, Miloš ; Otrusina, Lubomír (referee) ; Dytrych, Jaroslav (advisor)
Document type: Bachelor's theses
Year: 2016
Language: cze
Publisher: Vysoké učení technické v Brně. Fakulta informačních technologií
Abstract: [cze] [eng]

Keywords: Boilerpipe; CommonCrawl; Justext; natural language processing.; text classification; text extraction; Vertcalization; web; Boilerpipe; CommonCrawl; extrakcia textu; Justext; klasifikácia textu; spracovanie prirodzeného jazyka.; Vertikalizácia; web

Institution: Brno University of Technology (web)
Document availability information: Fulltext is available in the Brno University of Technology Digital Library.
Original record: http://hdl.handle.net/11012/62205

Permalink: http://www.nusl.cz/ntk/nusl-586965


The record appears in these collections:
Universities and colleges > Public universities > Brno University of Technology
Academic theses (ETDs) > Bachelor's theses
 Record created 2024-04-02, last modified 2024-04-03


No fulltext
  • Export as DC, NUŠL, RIS
  • Share