copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning to construct knowledge bases from the World Wide Web

M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Artificial Intelligence, 118 (1â2): 69 - 113 (2000)
DOI: 10.1016/S0004-3702(00)00004-7

Abstract

The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a computer understandable knowledge base whose content mirrors that of the World Wide Web. Such a knowledge base would enable much more effective retrieval of Web information, and promote new uses of the Web to support knowledge-based inference and problem solving. Our approach is to develop a trainable information extraction system that takes two inputs. The first is an ontology that defines the classes (e.g., company, person, employee, product) and relations (e.g., employed_by, produced_by) of interest when creating the knowledge base. The second is a set of training data consisting of labeled regions of hypertext that represent instances of these classes and relations. Given these inputs, the system learns to extract information from other pages and hyperlinks on the Web. This article describes our general approach, several machine learning algorithms for this task, and promising initial results with a prototype system that has created a knowledge base describing university people, courses, and research projects.

Description

Learning to construct knowledge bases from the World Wide Web 10.1016/S0004-3702(00)00004-7 : Artificial Intelligence | ScienceDirect.com

Links and resources

BibTeX key: craven2000learning
entry type: article
year: 2000
journal: Artificial Intelligence
number: 1â2
pages: 69 - 113
volume: 118
issn: 0004-3702
DOI: 10.1016/S0004-3702(00)00004-7
url: http://www.sciencedirect.com/science/article/pii/S0004370200000047

@dbenz's tags highlighted

Cite this publication

@article{craven2000learning, abstract = {The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a computer understandable knowledge base whose content mirrors that of the World Wide Web. Such a knowledge base would enable much more effective retrieval of Web information, and promote new uses of the Web to support knowledge-based inference and problem solving. Our approach is to develop a trainable information extraction system that takes two inputs. The first is an ontology that defines the classes (e.g., company, person, employee, product) and relations (e.g., employed_by, produced_by) of interest when creating the knowledge base. The second is a set of training data consisting of labeled regions of hypertext that represent instances of these classes and relations. Given these inputs, the system learns to extract information from other pages and hyperlinks on the Web. This article describes our general approach, several machine learning algorithms for this task, and promising initial results with a prototype system that has created a knowledge base describing university people, courses, and research projects.}, added-at = {2012-02-03T07:28:21.000+0100}, author = {Craven, Mark and DiPasquo, Dan and Freitag, Dayne and McCallum, Andrew and Mitchell, Tom and Nigam, Kamal and Slattery, SeÃ¡n}, biburl = {https://www.bibsonomy.org/bibtex/25a061b694a475d34557c7e0a9ff9854b/dbenz}, description = {Learning to construct knowledge bases from the World Wide Web 10.1016/S0004-3702(00)00004-7 : Artificial Intelligence | ScienceDirect.com}, doi = {10.1016/S0004-3702(00)00004-7}, interhash = {68683ddac8974e9b3867c4b076a2b52f}, intrahash = {5a061b694a475d34557c7e0a9ff9854b}, issn = {0004-3702}, journal = {Artificial Intelligence}, keywords = {ba bachelor2011bachmann classification learning naivebayes semantics website_classification}, number = {1â2}, pages = {69 - 113}, timestamp = {2013-07-31T15:39:42.000+0200}, title = {Learning to construct knowledge bases from the World Wide Web}, url = {http://www.sciencedirect.com/science/article/pii/S0004370200000047}, volume = 118, year = 2000 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning to construct knowledge bases from the World Wide Web

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Learning to construct knowledge bases from the World Wide Web

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning to construct knowledge bases from the World Wide Web

Comments and Reviews
(0)