entry of dret:
(0)
This publication has not been reviewed yet.
rating distribution
average user rating
?
The average rating is computed over all reviews. However, some of them may be invisible to you due to the visibility setting chosen by the reviewers.
From Legacy Documents to XML: A Conversion Framework
by:In: Proceedings of the 9th European Conference on Digital Libraries, Vol. 3652 Vienna, Austria:
Springer-Verlag
(September 2005)
, p. 92-103.
Abstract
We present an integrated framework for the document conversion from legacy formats to XML format. We describe the LegDoC project, aimed at automating the conversion of layout annotations layout-oriented formats like PDF, PS and HTML to semantic-oriented annotations. A toolkit of different components covers complementary techniques the logical document analysis and semantic annotations with the methods of machine learning. We use a real case conversion project as a driving example to exemplify different techniques implemented in the project.
Description
dret'd bibliography


publication