Article,

DEVNAGARI DOCUMENT SEGMENTATION USING HISTOGRAM APPROACH

V. Dongre, and V. Mankar.
International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), 1 (3): 46-53 (August 2011)
DOI: 10.5121/ijcseit.2011.1305

Full text

Abstract

Document segmentation is one of the critical phases in machine recognition of any language. Correct segmentation of individual symbols decides the accuracy of character recognition technique. It is used to decompose image of a sequence of characters into sub images of individual symbols by segmenting lines and words. Devnagari is the most popular script in India. It is used for writing Hindi, Marathi, Sanskrit and Nepali languages. Moreover, Hindi is the third most popular language in the world. Devnagari documents consist of vowels, consonants and various modifiers. Hence proper segmentation of Devnagari word is challenging. A simple histogram based approach to segment Devnagari documents is proposed in this paper. Various challenges in segmentation of Devnagari script are also discussed.

BibTeX key: noauthororeditor
entry type: article
year: 2011
month: August
journal: International Journal of Computer Science, Engineering and Information Technology (IJCSEIT)
number: 3
pages: 46-53
volume: 1
language: English
issn: 2231-3117 Online ; 2231-3605 Print
DOI: 10.5121/ijcseit.2011.1305
Document: http://airccse.org/journal/ijcseit/papers/0811ijcseit05.pdf

BibSonomy

DEVNAGARI DOCUMENT SEGMENTATION USING HISTOGRAM APPROACH

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on