Telugu Word Segmentation Using Fringe Maps

dc.contributor.author Devarapalli, Koteswara Rao
dc.contributor.author Negi, Atul
dc.date.accessioned 2022-03-27T05:52:44Z
dc.date.available 2022-03-27T05:52:44Z
dc.date.issued 2019-01-01
dc.description.abstract In this paper, we propose a word segmentation method that is based on fringe maps on Telugu script. Our objective is to create a data set of word images for enabling direct training for recognition on those. The standard methods employed for the task of word segmentation in Telugu OCR systems are projection profiles and run-length smearing. However those methods have their limitations. In this work a different application of fringe maps is shown for line segmentation into words. Fringes were previously applied successfully for carrying out classification and line segmentation. Telugu script, which has consonant modifiers that are usually placed below or below-right to the base consonants. This kind of orthographic property leads to characters that may touch each other. One way to deal with touched characters is to make use of segmentation free methods, which do not need prior segmentation of word images into characters or connected components. The novelty of our method is that we analyze fringe maps of document images to find an appropriate fringe value threshold and apply it for word segmentation of Telugu documents. Encouraging results are observed with our fringe value threshold based word segmentation. We observe that choosing higher threshold fringe values leads to under-segmentation of words, whereas lower values cause over-segmentation of words. Our word segmentation approach is successfully compared with the widely used projection profiles based word segmentation method.
dc.identifier.citation Communications in Computer and Information Science. v.1020
dc.identifier.issn 18650929
dc.identifier.uri 10.1007/978-981-13-9361-7_8
dc.identifier.uri http://link.springer.com/10.1007/978-981-13-9361-7_8
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8558
dc.subject Akshara
dc.subject Fringe distance
dc.subject Telugu OCR
dc.subject Word segmentation
dc.title Telugu Word Segmentation Using Fringe Maps
dc.type Book Series. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: