Publication Type:

Journal Article

Source:

Indian Journal of Science and Technology, Indian Society for Education and Environment, Volume 8, Issue 24, Number 24 (2015)

URL:

https://www.scopus.com/inward/record.uri?eid=2-s2.0-84944446067&partnerID=40&md5=3522f0b39a89d3ebf3bf3f1dbafaa6d4

Abstract:

In this Information age, all sources of information like historic documents, books, manuscripts are digitized and are available all over the world through internet in the form of scanned copies. These scanned images contain valuable information which are available either in colour or black and white for pleasant viewing. Optical Character Recognition (OCR) technology provides facility to search for keywords in these digital copies. In this paper, new method in which building an OCR system for Telugu language script; mainly focussing on the character recognition module. Features extracted through Discrete Wavelet Transform (DWT), Projection Profile (PP) and Singular Value Decomposition (SVD) is evaluated using k-Nearest Neighbour (k-NN) and Support Vector Machine (SVM) classifiers. Most productive results are obtained from the DWT features with SVM classifiers.

Cite this Research Publication

J. Jyothi, Manjusha, K., M. Kumar, A., and Soman, K. P., “Innovative feature sets for machine learning based Telugu character recognition”, Indian Journal of Science and Technology, vol. 8, no. 24, 2015.