copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Adaptive information extraction from text by rule induction and generalisation

F. Ciravegna. Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2, page 1251--1256. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (2001)

Abstract

(LP)2 is a covering algorithm for adaptive Information Extraction from text (IE). It induces symbolic rules that insert SGML tags into texts by learning from examples found in a user-defined tagged corpus. Training is performed in two steps: initially a set of tagging rules is learned; then additional rules are induced to correct mistakes and imprecision in tagging. Induction is performed by bottom-up generalization of examples in the training corpus. Shallow knowledge about Natural Language Processing (NLP) is used in the generalization process. The algorithm has a considerable success story. From a scientific point of view, experiments report excellent results with respect to the current state of the art on two publicly available corpora. From an application point of view, a successful industrial IE tool has been based on (LP)2. Real world applications have been developed and licenses have been released to external companies for building other applications. This paper presents (LP)2, experimental results and applications, and discusses the role of shallow NLP in rule induction.

Description

Adaptive information extraction from text by rule induction and generalisation

Links and resources

BibTeX key: Ciravegna:2001:AIE:1642194.1642261
entry type: inproceedings
address: San Francisco, CA, USA
booktitle: Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
year: 2001
pages: 1251--1256
publisher: Morgan Kaufmann Publishers Inc.
series: IJCAI'01
location: Seattle, WA, USA
acmid: 1642261
isbn: 1-55860-812-5, 978-1-558-60812-2
numpages: 6
Document: http://eprints.aktors.org/118/01/IJCAI01.pdf

@jil's tags highlighted

Cite this publication

@inproceedings{Ciravegna:2001:AIE:1642194.1642261, abstract = {(LP)2 is a covering algorithm for adaptive Information Extraction from text (IE). It induces symbolic rules that insert SGML tags into texts by learning from examples found in a user-defined tagged corpus. Training is performed in two steps: initially a set of tagging rules is learned; then additional rules are induced to correct mistakes and imprecision in tagging. Induction is performed by bottom-up generalization of examples in the training corpus. Shallow knowledge about Natural Language Processing (NLP) is used in the generalization process. The algorithm has a considerable success story. From a scientific point of view, experiments report excellent results with respect to the current state of the art on two publicly available corpora. From an application point of view, a successful industrial IE tool has been based on (LP)2. Real world applications have been developed and licenses have been released to external companies for building other applications. This paper presents (LP)2, experimental results and applications, and discusses the role of shallow NLP in rule induction.}, acmid = {1642261}, added-at = {2012-10-11T17:00:22.000+0200}, address = {San Francisco, CA, USA}, author = {Ciravegna, Fabio}, biburl = {https://www.bibsonomy.org/bibtex/25eb346593c6330ee742947824e75e710/jil}, booktitle = {Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2}, description = {Adaptive information extraction from text by rule induction and generalisation}, interhash = {8e97c7bdb4db3c8144c32849b12b9714}, intrahash = {5eb346593c6330ee742947824e75e710}, isbn = {1-55860-812-5, 978-1-558-60812-2}, keywords = {extraction induction information learning pattern}, location = {Seattle, WA, USA}, numpages = {6}, pages = {1251--1256}, publisher = {Morgan Kaufmann Publishers Inc.}, series = {IJCAI'01}, timestamp = {2013-11-23T20:11:51.000+0100}, title = {Adaptive information extraction from text by rule induction and generalisation}, url = {http://eprints.aktors.org/118/01/IJCAI01.pdf}, year = 2001 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Adaptive information extraction from text by rule induction and generalisation

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Adaptive information extraction from text by rule induction and generalisation

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Adaptive information extraction from text by rule induction and generalisation

Comments and Reviews
(0)