A robust algorithm for text string separation from mixed text/graphics
images
L. Fletcher, and R. Kasturi. IEEE Transactions on Pattern Analysis and Machine Intelligence, 10 (6):
910-918(November 1988)
DOI: 10.1109/34.9112
Abstract
The development and implementation of an algorithm for automated text
string separation that is relatively independent of changes in text
font style and size and of string orientation are described. It is
intended for use in an automated system for document analysis. The
principal parts of the algorithm are the generation of connected
components and the application of the Hough transform in order to
group components into logical character strings that can then be
separated from the graphics. The algorithm outputs two images, one
containing text strings and the other graphics. These images can
then be processed by suitable character recognition and graphics
recognition systems. The performance of the algorithm, both in terms
of its effectiveness and computational efficiency, was evaluated
using several test images and showed superior performance compared
to other techniques
%0 Journal Article
%1 FletcherNov1988
%A Fletcher, L.A.
%A Kasturi, R.
%D 1988
%J IEEE Transactions on Pattern Analysis and Machine Intelligence
%K analysis, character computer computerised computerized document graphics graphics, images, mixed pattern picture processing, recognition, separation string text text/graphics transform, transformsHough
%N 6
%P 910-918
%R 10.1109/34.9112
%T A robust algorithm for text string separation from mixed text/graphics
images
%V 10
%X The development and implementation of an algorithm for automated text
string separation that is relatively independent of changes in text
font style and size and of string orientation are described. It is
intended for use in an automated system for document analysis. The
principal parts of the algorithm are the generation of connected
components and the application of the Hough transform in order to
group components into logical character strings that can then be
separated from the graphics. The algorithm outputs two images, one
containing text strings and the other graphics. These images can
then be processed by suitable character recognition and graphics
recognition systems. The performance of the algorithm, both in terms
of its effectiveness and computational efficiency, was evaluated
using several test images and showed superior performance compared
to other techniques
@article{FletcherNov1988,
abstract = {The development and implementation of an algorithm for automated text
string separation that is relatively independent of changes in text
font style and size and of string orientation are described. It is
intended for use in an automated system for document analysis. The
principal parts of the algorithm are the generation of connected
components and the application of the Hough transform in order to
group components into logical character strings that can then be
separated from the graphics. The algorithm outputs two images, one
containing text strings and the other graphics. These images can
then be processed by suitable character recognition and graphics
recognition systems. The performance of the algorithm, both in terms
of its effectiveness and computational efficiency, was evaluated
using several test images and showed superior performance compared
to other techniques},
added-at = {2011-03-27T19:47:06.000+0200},
author = {Fletcher, L.A. and Kasturi, R.},
biburl = {https://www.bibsonomy.org/bibtex/20baf98dc4f9cbff3a9f331ec76899d80/cocus},
doi = {10.1109/34.9112},
file = {:./fletcher1988.pdf:PDF},
interhash = {f1a10c02fa4b335e7a2ed207dd3ce1ab},
intrahash = {0baf98dc4f9cbff3a9f331ec76899d80},
issn = {0162-8828},
journal = {{IEEE} Transactions on Pattern Analysis and Machine Intelligence},
keywords = {analysis, character computer computerised computerized document graphics graphics, images, mixed pattern picture processing, recognition, separation string text text/graphics transform, transformsHough},
month = nov,
number = 6,
pages = {910-918},
timestamp = {2011-03-27T19:47:07.000+0200},
title = {A robust algorithm for text string separation from mixed text/graphics
images},
volume = 10,
year = 1988
}