Fringe map based text line segmentation of printed Telugu document images

dc.contributor.author Koppula, Vijaya Kumar
dc.contributor.author Negi, Atul
dc.date.accessioned 2022-03-27T05:53:16Z
dc.date.available 2022-03-27T05:53:16Z
dc.date.issued 2011-12-02
dc.description.abstract Text line segmentation is a crucial and important step which can greatly influence the accuracy of an OCR system. One of the major obstacles to building high-accuracy OCR systems for Indic scripts has been the text line segmentation problem. In particular for Telugu script this problem is still to be adequately addressed by research. The common methods of Roman script are not applicable due to the inherent script complexity of Telugu. Previous approaches to Telugu OCR in the literature take a simplified view of the problem, leading to errors in line segmentation. The problem is compounded in old documents that are typeset manually and have non-uniform print quality. In this work we propose a new method using the fringe map concept. In a fringe map each pixel of the binary image is associated with a fringe number that denotes the distance to the nearest black pixel. We use fringe value information to segment text lines. First we locate peak fringe numbers (PFNs). PFNs that are not between lines are filtered out. PFNs between adjacent lines are used to construct a region. The segmenting path between the adjacent lines is found by joining the filtered PFNs of a region. © 2011 IEEE.
dc.identifier.citation Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
dc.identifier.issn 15205363
dc.identifier.uri 10.1109/ICDAR.2011.260
dc.identifier.uri http://ieeexplore.ieee.org/document/6065519/
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8612
dc.subject Fringe Maps
dc.subject Indic scripts
dc.subject Telugu OCR
dc.subject Text line segmentation
dc.title Fringe map based text line segmentation of printed Telugu document images
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: