Original title: Semi-Supervised Approach To Train Captcha Letter Position Detetor
Authors: Bostik, Ondrej
Document type: Papers
Language: eng
Publisher: Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií
Abstract: Common Optical Character Recognition (OCR) methods benefit from the fact, that the text is distributedin images in a predictable pattern. This is not the situation with CAPTCHA systems. UtilizingOCR algorithms to overcome common web anti-abuse CAPTCHA systems is therefore a challengingtask. To train a system to overcome any CAPTCHA scheme, an attacker needs a huge dataset ofannotated images. And for some methods, the attacker needs not only the right answers but also anexact position of the character in the CAPTCHA image.Annotate the positions of the object in an image is a time-consuming task. In this paper, we proposea system, which can help to annotate the position of CAPTCHA character with minimal humaninteraction. After annotating a small sample of targeted CAPTCHA images, a YOLO-based regiondetection deep network is used to search for the characters’ locations.
Keywords: CAPTCHA; Deep learning; MATLAB; OCR; semi-supervised learning; YOLO v2
Host item entry: Proceedings I of the 27st Conference STUDENT EEICT 2021: General papers, ISBN 978-80-214-5942-7

Institution: Brno University of Technology (web)
Document availability information: Fulltext is available in the Brno University of Technology Digital Library.
Original record: http://hdl.handle.net/11012/200796

Permalink: http://www.nusl.cz/ntk/nusl-447841


The record appears in these collections:
Universities and colleges > Public universities > Brno University of Technology
Conference materials > Papers
 Record created 2021-07-25, last modified 2021-08-22


No fulltext
  • Export as DC, NUŠL, RIS
  • Share