Script-agnostic Reflow of Text in Document Images

Panjwani, S and Uppal, A and Cutrell, E (2011) Script-agnostic Reflow of Text in Document Images. In: Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services.

Full text not available from this repository. (Request a copy)

Abstract

Reading text from document images can be difficult on mobile devices due to the limited screen width available on them. While there exist solutions for reflowing Latin-script texts on such devices, these solutions do not work well for images of other scripts or combinations of scripts, since they rely on script-specific characteristics or OCR. We present a technique that reflows text in document images in a manner that is agnostic to the script used to compose them. Our technique achieved over 95% segmentation accuracy for a corpus of 139 images containing text in 4 genetically-distant languages-English, Hindi, Kannada and Arabic. A preliminary user study with a prototype implementation of the technique provided evidence of some of its usability benefits.

Affiliation: Indian School of Business
ISB Creiators:
ISB Creators
ORCiD
Uppal, A
UNSPECIFIED
Item Type: Conference or Workshop Item (Paper)
Additional Information: The research article was published by the author with the affiliation of IIT Delhi.
Uncontrolled Keywords: document images, reflow, script-agnostic, segmentation
Subjects: Applied Statistics and Computing
Depositing User: Mohan Dass
Date Deposited: 08 Apr 2019 10:07
Last Modified: 08 Apr 2019 10:07
URI: http://eprints.exchange.isb.edu/id/eprint/767
Publisher URL: http://doi.acm.org/10.1145/2037373.2037419
Related URLs:

Actions (login required)

View Item View Item
Statistics for DESI ePrint 767 Statistics for this ePrint Item