<p>Pre-processing of document images is one of the most intensive operations for pre-printed document images. The recognition of text in pre-printed documents is most sensitive to graphical components coexisting with it. In this paper we address the problem of detection and removal of graphical components like logos, emblems and other symbolic entities, which leads to an error free document processing in the subsequent stages of Optical Character Recognition. The detection of graphical entities is performed by employing Zernike moments and histogram of gradient features, followed by which the line detection and removal is accomplished by masking the image with a vertical line structuring element by computation of region covered by convex hull within the area by structuring element in the image. The detection of line structuring element also addresses the problem of characters overlapping with lines leading to retention of the character during erosion of lines from the image. The experimental outcomes produced by emblem detection of algorithm are appreciable with accuracy of around 97% for the emblem detection and 92% accurate outcomes in case of line detection and removal. © Research India Publications.</p>
cited By 0
S. N. Rani, Vineeth, P., and Ajith, D., “Detection and removal of graphical components in pre-printed documents”, International Journal of Applied Engineering Research, vol. 11, pp. 4849-4856, 2016.