System for printed text recognition using Tesseract library
DOI:
https://doi.org/10.24867/19IH04PopovicKeywords:
Printed text recognition, Tesseract, camera, Python, machine visionAbstract
Paper presents one solution of the system that allows recognition of the printed text on the image using digital image processing, i.e. machine vision. The goal was to create an application by writing the program code in Python and by using the Tesseract library, which allows printed text recognition, after which a file containing the recognized text is formed. Sensitivity of the proposed solution was analyzed against the type of acquisition using camera, i.e. character size, as well as which processing techniques best affect the desired result.
References
[1] Chen, X., Jin, L., Zhu, Y., Luo, C., & Wang, T. (2021). Text recognition in the wild: A survey. ACM Computing Surveys (CSUR), 54(2), 1-35.
[2] Gonzalez, R. C., & Woods, R. E. (2018). Digital Image Processing, Hoboken.
[3] Sonka, M., Hlavac, V., & Boyle, R. (2014). Image processing, analysis, and machine vision. Cengage Learning.
[4] Smith, R. (2007) An overview of the Tesseract OCR engine. ICDAR 2007, Vol. 2, pp. 629-633. IEEE.
[2] Gonzalez, R. C., & Woods, R. E. (2018). Digital Image Processing, Hoboken.
[3] Sonka, M., Hlavac, V., & Boyle, R. (2014). Image processing, analysis, and machine vision. Cengage Learning.
[4] Smith, R. (2007) An overview of the Tesseract OCR engine. ICDAR 2007, Vol. 2, pp. 629-633. IEEE.
Downloads
Published
2022-10-05
Issue
Section
Mechatronics