Open Access Open Access  Restricted Access Subscription or Fee Access

Optical/Handwritten Data Recognition

Rewant Pandey, Shikhar Yadav

Abstract


For many years, optical character recognition (OCR) has been a hot topic. It's the process of breaking down a document image into its individual characters. Despite decades of intensive research, producing OCR with human-like skills is still a work in progress. The industries have long used localization and recognition of written characters for a number of applications. When working in a sterile environment, such as scanners or static settings, most optical character recognition (OCR) systems function well. The demand for OCR in the natural environment is increasing as technology advances. The most notable example is the recognition and localization of text outside, when the settings are far from ideal for machine vision applications. The main goal of this project is to create an Web based application that can recognise text content in JPEG photographs, PDF files, or photos shot in real time and convert them into an editable content document for several languages. For the same reason, this study focuses on the translation of written materials into speech. Due to the difficulty of the task, industrial and university researchers have turned their attention to Optical Character Recognition. The number of academic laboratories and businesses engaged in character recognition research has risen considerably in recent years. Optical character recognition (OCR) allows for a variety of automation applications. The goal of this research is to discover and recognise words in natural photographs. The targeted challenge is substantially more difficult than interpreting text from scanned documents. The goal of this study is to summarise the existing research in the topic of OCR. It gives an overview of many aspects of OCR and explores relevant solutions for fixing OCR concerns. The project had an accuracy rate of more than 80% in character recognition and more than 60% in handwritten character recognition. This document describes the project's stages of development, main hurdles, and some of the project's most noteworthy findings.


Keywords


Optical Character Recognition (OCR), text-to-speech synthesizer, handwritten text recognition, Image Processing, OpenCV.

Full Text:

PDF

References


Divya, Peketi, Mahesh Varma, and Uma Ratna Mouli. "Web based optical character recognition application using flask and tesseract." Materials Today: Proceedings (2021).

Pradeep, J., E. Srinivasan, and S. Himavathi. "Diagonal based feature extraction for handwritten character recognition system using neural network." 2011 3rd international conference on electronics computer technology. Vol. 4. IEEE, 2011.

Muhammad'Arif Mohamad, Dewi Nasien, Haswadi Hassan, and Habibollah Haron. "A review on feature extraction and feature selection for handwritten character recognition." International Journal of Advanced Computer Science and Applications 6.2 (2015): 204-212.

Ayyaz, Muhammad Naeem, Imran Javed, and Waqar Mahmood. "Handwritten character recognition using multiclass svm classification with hybrid feature extraction." Pakistan Journal of Engineering and Applied Sciences (2016).

Vaidya, Swapnil A., and Balaji R. Bombade. "A novel approach of handwritten character recognition using positional feature extraction." International Journal of Computer Science and Mobile Computing 2.6 (2013): 179-186.

SAMOSIR, WILLIAM KAMDESU. Text detection and text extraction on images with Tesseract OCR. Diss. Universitas Katholik Soegijapranata Semarang, 2021.

Revathi, A. S., and Nishi A. Modi. "Comparative Analysis of Text Extraction from Color Images using Tesseract and OpenCV." 2021 8th International Conference on Computing for Sustainable Global Development (INDIACom). IEEE, 2021.

Dome, Saurabh, and Asha P. Sathe. "Optical Character Recognition using Tesseract and Classification." 2021 International Conference on Emerging Smart Computing and Informatics (ESCI). IEEE, 2021.

Salar, Siddharth. "Automate Identification and Recognition of Handwritten Text from an Image." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12.3 (2021): 3800-3808.

Zhou, Xinyu, et al. "East: an efficient and accurate scene text detector." Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2017.


Refbacks

  • There are currently no refbacks.


Copyright (c) 2021 Trends in Opto-Electro and Optical Communications