Original title:
A Machine for Automatic Subject Indexing Using ToC
Authors:
Pokorný, Jan Document type: Papers Conference/Event: ELAG 2018, Praha (CZ), 2018-06-04 / 2018-06-07
Year:
2018
Language:
eng Abstract:
The technology developed in the National Library of Technology can extract a document’s table of content (TOC), generate relevant keywords, and suggest terms for various classification schemas (UDC, DDC, LCC, Conspectus). It can fully or substantially automate the process of generating subject access, unite it across libraries, and significantly increase accuracy and relevancy compared to subject assignments by non-specialist catalogers. Such increased quality in subject access terms is often seen in the superior subject facets generated by discovery systems and library OPAC advanced search forms.
Keywords:
automatic classification; identification description; index analysis; optical character processing; subject description; automatická klasifikace; identifikační popis; indexní analýza; optické zpracování znaků; věcný popis
Rights: This work is protected under the Copyright Act No. 121/2000 Coll.; License: Creative Commons Attribution-NonCommercial-NoDerivs 4.0