Open Access Open Access  Restricted Access Subscription or Fee Access

Text Line Segmentation for Kannada Language Using Enhanced Horizantol Projection Profile Method

Shakunthala B S, Ullas H.S., Pillai C.S. C.S., M Suresh

Abstract


Handwritten character image is taken as dataset for this method. Segmentation is crucial in the Human Character Recognition System for extracting text lines, words, and characters from handwritten Kannada documents. In the proposed system, segmenting text lines, word, characters are done based on enhanced horizantol projection profile approach. The algorithm will be used for finding the height and width of the entire handwritten word The horizontal projection profile approach is evaluated using photographs from a handwritten Kannada manuscript with 100 data set dimension. The system achieved an average segmentation rate of 96.38% on fully unconstrained handwritten Kannada documents.


Keywords


Skew detection and correction, Preprocessing, Noise removal, Binarization, Text line segmentation, Feature extraction.

Full Text:

PDF

References


M. A. Radwan, M. I. Khalil, and H. M. Abbas, ``Neural networks pipeline for offline machine printed Arabic OCR,'' Neural Process. Lett., vol. 48, no. 2, pp. 769_787, Oct. 2018.

P. Thompson, R. T. Batista-Navarro, G. Kontonatsios, J. Carter, E. Toon, J. McNaught, C. Timmermann, M. Worboys, and S. Ananiadou, ``Text mining the history of medicine,'' PLoS ONE, vol.11, no. 1, pp. 1_33, Jan. 2016.

R. Zanibbi and D. Blostein, ``Recognition and retrieval of mathematical expressions,'' Int. J. Document Anal. Recognit., vol. 15, no. 4, pp. 331_357, Dec. 2012, doi: 10.1007/s10032-011-0174-4.

C.Wolf, J.-M. Jolion, and F. Chassaing, ``Text localization, enhancement and binarization in multimedia documents,'' in Proc. Object Recognit. Supported User Interact. Service Robots, vol. 2, 2002, pp. 1037_1040.

S. Mori, C. Y. Suen, and K. Yamamoto, “Historical review of ocr research and development,” Proceedings of the IEEE, vol. 80, no. 7, pp. 1029– 1058, 1992. 25.[2] A. Amin, “Off-line arabic character recognition: the state of the art,” Pattern recognition, vol. 31, no. 5,pp. 517–530, 1998.

D. Jayaram, C. Reddy, K. Prasad, and M. S. Das, “An overview of optical character recognition systems research on telugu language,” International Journal of Science and Advanced Technology, vol. 2, no. 9, pp. 23–29, 2012.

S. Karthik and K. S. Murthy , “Deep belief network based approach to recognize handwritten kannada characters using distributed average of gradients,” Cluster Computing, pp. 1–9, 2018.

A. Mowlaei and K. Faez, Recognition of Isolated Handwritten Persian/Arabic characters and Numerals Using Support Vector Machines, Proceedings of XIII Workshop on Nueral Networks for Signal Processing, pp. 547-554, 2003.

Hanmandlu M, A V Nath, A.C Mishra and V.K Madasu, Fuzzy Model Based Recognition of Handwritten Hindi Numerals using Bacterial Forgaging , 6th IEEE/ACIS International Conference on Computer and Information Science(ICIS 2007), Computer Society, 2007.

Ying Wen, Yue Lu, Pengfei Sh. Handwritten Bangla numeral recognition system and its application to postal automation, Pattern Recognition, 40(1), pp 99-107, 2007.

Banashree N P, R Vasanta, OCR for script identification of Hindi(Devanagari) Numerals using Feature sub selection by means of End-Point with Neuro- Memetic Model, International Journal of Intelligent Systems and Technologies 2, pp 206-210, 2008.

S.V. Rajashekararadhya and Dr. P. Vanaja Ranjan, Efficient zone based feature extraction algorithm for handwritten numeral recognition of four popular south Indian scripts, Journal of Theoretical and Applied Information Technology, pp. 1171-1180,2005-06.

U. Pal, T. Wakabayashi, N. Sharma and F. Kimura, Handwritten Numeral Recognition of Six Popular Indian Scripts. In Proc. 9th International Conference on Document Analysis and Recognition. Curitiba, Brazil, September 24-26, pp. 749-753, 2007.

Benne R.G.,Dhandra B.V and Mallikarjun Hangarge, Tri-scripts handwritten numeral recognition: a novel approach:, Advances in Computational Research, Volume 1, Issue 2, pp 47-51, 2009.

Dinesh Acharya U, N V Subbareddy and Krishnamoorthy, Multilevel Classifier in Recognition of Handwritten Kannada Numeral, Proceedings of World Academy of Science, Engineering And Technology, Vol. 32, pp 308- 313,2008.

G. G. Rajput and Mallikarjun Hangarge, Recognition of Isolated Kannada Numeral Based on Image Fusion Method. PReMI 2007, LNCS 4815, pp. 153– 160, 2007


Refbacks

  • There are currently no refbacks.


Copyright (c) 2023 Journal of Electronic Design Technology