Understanding paper documents

dc.contributor.author Agarwal, Arun
dc.date.accessioned 2022-03-27T05:52:33Z
dc.date.available 2022-03-27T05:52:33Z
dc.date.issued 1993-06-01
dc.description.abstract We describe the organization and several components of an automated document processing system that begins with digitized images of documents and produces representations at higher levels. Such representations inlcude: the visual sketch (connected components extracted from the background), physical layout (spatial extents of blocks corresponding to text, graphics), logical layout (grouping of strings into words and phrases), and block primitives (e.g., recognised characters and words in text blocks, recognition of hand-drawn line drawings i.e. schematic electronic circuits). We describe algorithms for deriving several of the representations and describe the interaction of different modules. The methods are illustrated with examples. © 1993 the Indian Academy of Sciences.
dc.identifier.citation Sadhana. v.18(2)
dc.identifier.issn 02562499
dc.identifier.uri 10.1007/BF02742657
dc.identifier.uri http://link.springer.com/10.1007/BF02742657
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8539
dc.subject Connected component analysis
dc.subject segmentation
dc.subject text and graphic recognition
dc.subject thinning
dc.title Understanding paper documents
dc.type Journal. Article
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: