Original title: Rozšíření Apache Tika o extrakci textu ze souborů průmyslových formátů
Translated title: Extension of Apache Tika with Industrial File Formats Text Extraction
Authors: Rešetár, René ; Burget, Radek (referee) ; Rychlý, Marek (advisor)
Document type: Bachelor's theses
Year: 2021
Language: cze
Publisher: Vysoké učení technické v Brně. Fakulta informačních technologií
Abstract: [cze] [eng]

Keywords: .arff; Apache Tika; control laboratories; csv; data extraction; data integrity; farmaceutic industry; Java; JSON; laboratories; Maven; MIME-types; non-paper laboratories; pdf; Service Provider; software; structured data; SVP; table extraction; weka; xlsx; .arff; Apache Tika; bez papierové laboratórium; csv; extrakcia dát; extrakcia tabuliek; farmaceutický priemysel; integrita dát; Java; JSON; kontrolné laboratória; laboratória; Maven; MIME-typy; pdf; Service Provider; software; SVP; weka; xlsx; štruktúrované dáta

Institution: Brno University of Technology (web)
Document availability information: Fulltext is available in the Brno University of Technology Digital Library.
Original record: http://hdl.handle.net/11012/199350

Permalink: http://www.nusl.cz/ntk/nusl-444746


The record appears in these collections:
Universities and colleges > Public universities > Brno University of Technology
Academic theses (ETDs) > Bachelor's theses
 Record created 2021-06-27, last modified 2022-09-04


No fulltext
  • Export as DC, NUŠL, RIS
  • Share