copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning table extraction from examples

A. Tengli, Y. Yang, and N. Ma. Proceedings of the 20th international conference on Computational Linguistics, Stroudsburg, PA, USA, Association for Computational Linguistics, (2004)
DOI: 10.3115/1220355.1220497

Abstract

Information extraction from tables in web pages is a challenging problem due to the diverse nature of table formats and the vocabulary variants in attribute names. This paper presents a new approach to automated table extraction that exploits formatting cues in semi-structured HTML tables, learns lexical variants from training examples and uses a vector space model to deal with non-exact matches among labels. We conducted experiments with this method on a set of tables collected from 157 university web sites, and obtained the information extraction performance of 91.4% in the Fl-measure, showing the effectiveness of the combined use of structural table parsing and example-based label learning.

Links and resources

BibTeX key: tengli2004learning
entry type: inproceedings
address: Stroudsburg, PA, USA
booktitle: Proceedings of the 20th international conference on Computational Linguistics
year: 2004
publisher: Association for Computational Linguistics
series: COLING '04
timestamp: 2012-09-20 02:54:59
username: porta
intrahash: 9b4568ebe3e9995185a37e5de2846053
location: Geneva, Switzerland
acmid: 1220497
interhash: c4d46d3f1fed4c9d8830181b9c02d73c
articleno: 987
groups: public
DOI: 10.3115/1220355.1220497
url: http://dx.doi.org/10.3115/1220355.1220497

@porta's tags highlighted

Cite this publication

search on

Meta data

Last update 11 years ago
Created 12 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning table extraction from examples

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Learning table extraction from examples

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning table extraction from examples

Comments and Reviews
(0)