Robust text line, word and character extraction from telugu document image

Koppula, Vijaya Kumar; Atul, Negi; Garain, Utpal

Robust text line, word and character extraction from telugu document image

dc.contributor.author	Koppula, Vijaya Kumar
dc.contributor.author	Atul, Negi
dc.contributor.author	Garain, Utpal
dc.date.accessioned	2022-03-27T05:53:28Z
dc.date.available	2022-03-27T05:53:28Z
dc.date.issued	2009-12-01
dc.description.abstract	Designing an OCR system for Indian languages in general is more complex than those of European languages due the linguistic complexity. Efforts are on the way for the development of efficient OCR systems for Indian languages, especially for Telugu, a popular South Indian language. In this paper, we proposed a method for reliable extraction of text line, word and character from document images of Telugu scripts. In the text line segmentation, first we establish the relationship between the connected components and then cluster the connected components of a line using vertical spatial relation and nearest neighbor algorithm. In word segmentation, the space between two adjacent characters is computed and clustered into word space and character space. Consonant and vowel modifiers are segregated from the word image and segment the characters. © 2009 IEEE.
dc.identifier.citation	2009 2nd International Conference on Emerging Trends in Engineering and Technology, ICETET 2009
dc.identifier.uri	10.1109/ICETET.2009.196
dc.identifier.uri	http://ieeexplore.ieee.org/document/5395511/
dc.identifier.uri	https://dspace.uohyd.ac.in/handle/1/8631
dc.subject	Cluster
dc.subject	Connected component
dc.subject	Consonant and vowel modifiers
dc.subject	Vertical spatial relation
dc.title	Robust text line, word and character extraction from telugu document image
dc.type	Conference Proceeding. Conference Paper
dspace.entity.type

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

Computer and Information Sciences - Publications