Original title:
Semi-Supervised Approach To Train Captcha Letter Position Detetor
Authors:
Bostik, Ondrej Document type: Papers
Language:
eng Publisher:
Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií Abstract:
Common Optical Character Recognition (OCR) methods benefit from the fact, that the text is distributedin images in a predictable pattern. This is not the situation with CAPTCHA systems. UtilizingOCR algorithms to overcome common web anti-abuse CAPTCHA systems is therefore a challengingtask. To train a system to overcome any CAPTCHA scheme, an attacker needs a huge dataset ofannotated images. And for some methods, the attacker needs not only the right answers but also anexact position of the character in the CAPTCHA image.Annotate the positions of the object in an image is a time-consuming task. In this paper, we proposea system, which can help to annotate the position of CAPTCHA character with minimal humaninteraction. After annotating a small sample of targeted CAPTCHA images, a YOLO-based regiondetection deep network is used to search for the characters’ locations.
Keywords:
CAPTCHA; Deep learning; MATLAB; OCR; semi-supervised learning; YOLO v2 Host item entry: Proceedings I of the 27st Conference STUDENT EEICT 2021: General papers, ISBN 978-80-214-5942-7
Institution: Brno University of Technology
(web)
Document availability information: Fulltext is available in the Brno University of Technology Digital Library. Original record: http://hdl.handle.net/11012/200796