copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ALLRIGHT: Automatic Ontology Instantiation from Tabular Web Documents

K. Shchekotykhin, D. Jannach, G. Friedrich, and O. Kozeruk. Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea, volume 4825 of LNCS, page 463--476. Berlin, Heidelberg, Springer Verlag, (November 2007)

Abstract

The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present AllRight, a comprehensive ontology instantiating system. In particular, the techniques implemented in AllRight are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction approaches based on statistical or natural language processing methods are not directly applicable. Within AllRight, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. AllRight has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.

Links and resources

BibTeX key: Shchekotykhin/2007/ALLRIGHT:
entry type: inproceedings
address: Berlin, Heidelberg
booktitle: Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea
year: 2007
month: November
pages: 463--476
publisher: Springer Verlag
series: LNCS
volume: 4825
crossref: http://data.semanticweb.org/conference/iswc-aswc/2007/proceedings
Document: http://iswc2007.semanticweb.org/papers/463.pdf

@iswc2007's tags highlighted

Cite this publication

%0 Conference Paper %1 Shchekotykhin/2007/ALLRIGHT: %A Shchekotykhin, Kostyantyn %A Jannach, Dietmar %A Friedrich, Gerhard %A Kozeruk, Olga %B Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea %C Berlin, Heidelberg %D 2007 %E Aberer, Karl %E Choi, Key-Sun %E Noy, Natasha %E Allemang, Dean %E Lee, Kyung-Il %E Nixon, Lyndon J B %E Golbeck, Jennifer %E Mika, Peter %E Maynard, Diana %E Schreiber, Guus %E Cudré-Mauroux, Philippe %I Springer Verlag %K 2007 automatic document information_extraction instantiation iswc natural_language_processing ontology ontology_(computer_science) research_15 semantic_web web web_annotation %P 463--476 %T ALLRIGHT: Automatic Ontology Instantiation from Tabular Web Documents %U http://iswc2007.semanticweb.org/papers/463.pdf %V 4825 %X The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present AllRight, a comprehensive ontology instantiating system. In particular, the techniques implemented in AllRight are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction approaches based on statistical or natural language processing methods are not directly applicable. Within AllRight, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. AllRight has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.

@inproceedings{Shchekotykhin/2007/ALLRIGHT:, abstract = {The process of instantiating an ontology with high-quality and up-to-date instance information manually is both time consuming and prone to error. Automatic ontology instantiation from Web sources is one of the possible solutions to this problem and aims at the computer supported population of an ontology through the exploitation of (redundant) information available on the Web. In this paper we present AllRight, a comprehensive ontology instantiating system. In particular, the techniques implemented in AllRight are designed for application scenarios, in which the desired instance information is given in the form of tables and for which existing Information Extraction approaches based on statistical or natural language processing methods are not directly applicable. Within AllRight, we have therefore developed new techniques for dealing with tabular instance data and combined these techniques with existing methods. The system supports all necessary steps for ontology instantiation, i.e. web crawling, name extraction, document clustering as well as fact extraction and validation. AllRight has been successfully evaluated in the popular domains of digital cameras and notebooks leading to a about eighty percent accuracy of the extracted facts given only a very limited amount of seed knowledge.}, added-at = {2007-11-07T19:13:58.000+0100}, address = {Berlin, Heidelberg}, author = {Shchekotykhin, Kostyantyn and Jannach, Dietmar and Friedrich, Gerhard and Kozeruk, Olga}, biburl = {https://www.bibsonomy.org/bibtex/24c435d53c4fef1fe04a99efc9ce21f74/iswc2007}, booktitle = {Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea}, crossref = {http://data.semanticweb.org/conference/iswc-aswc/2007/proceedings}, editor = {Aberer, Karl and Choi, Key-Sun and Noy, Natasha and Allemang, Dean and Lee, Kyung-Il and Nixon, Lyndon J B and Golbeck, Jennifer and Mika, Peter and Maynard, Diana and Schreiber, Guus and Cudré-Mauroux, Philippe}, interhash = {83a9a6bd4e131e9e0fda01c4c40085c2}, intrahash = {4c435d53c4fef1fe04a99efc9ce21f74}, keywords = {2007 automatic document information_extraction instantiation iswc natural_language_processing ontology ontology_(computer_science) research_15 semantic_web web web_annotation}, month = {November}, pages = {463--476}, publisher = {Springer Verlag}, series = {LNCS}, timestamp = {2007-11-07T19:20:50.000+0100}, title = {ALLRIGHT: Automatic Ontology Instantiation from Tabular Web Documents}, url = {http://iswc2007.semanticweb.org/papers/463.pdf}, volume = 4825, year = 2007 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ALLRIGHT: Automatic Ontology Instantiation from Tabular Web Documents

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML ALLRIGHT: Automatic Ontology Instantiation from Tabular Web Documents

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ALLRIGHT: Automatic Ontology Instantiation from Tabular Web Documents

Comments and Reviews
(0)