Creating a Dead Poets Society: Extracting a Social Network of Historical Persons from the Web
G. Geleijnse, und J. Korst. Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea, Volume 4825 von LNCS, Seite 155--168. Berlin, Heidelberg, Springer Verlag, (November 2007)
Zusammenfassung
We present a simple method to extract information from search engine snippets. Although the techniques presented are domain independent, this work focuses on extracting biographical information of historical persons from multiple unstructured sources on the Web. We first similarly find a list of persons and their periods of life by querying the periods and scanning the retrieved snippets for person names. Subsequently, we find biographical informationfor the persons extracted. In order to get insight in the mutual relations among the persons identified, we create a social network using co-occurrences on the Web. Although we use uncontrolled and unstructured Web sources, the information extracted is reliable. Moreover we show that Web Information Extraction can be used to create both informative and enjoyable applications.
%0 Conference Paper
%1 Geleijnse/2007/Creating
%A Geleijnse, Gijs
%A Korst, Jan
%B Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea
%C Berlin, Heidelberg
%D 2007
%E Aberer, Karl
%E Choi, Key-Sun
%E Noy, Natasha
%E Allemang, Dean
%E Lee, Kyung-Il
%E Nixon, Lyndon J B
%E Golbeck, Jennifer
%E Mika, Peter
%E Maynard, Diana
%E Schreiber, Guus
%E Cudré-Mauroux, Philippe
%I Springer Verlag
%K 2007 dead information_extraction iswc natural_language_processing network ontology_(computer_science) person poet research_15 semantic_web social society web
%P 155--168
%T Creating a Dead Poets Society: Extracting a Social Network of Historical Persons from the Web
%U http://iswc2007.semanticweb.org/papers/155.pdf
%V 4825
%X We present a simple method to extract information from search engine snippets. Although the techniques presented are domain independent, this work focuses on extracting biographical information of historical persons from multiple unstructured sources on the Web. We first similarly find a list of persons and their periods of life by querying the periods and scanning the retrieved snippets for person names. Subsequently, we find biographical informationfor the persons extracted. In order to get insight in the mutual relations among the persons identified, we create a social network using co-occurrences on the Web. Although we use uncontrolled and unstructured Web sources, the information extracted is reliable. Moreover we show that Web Information Extraction can be used to create both informative and enjoyable applications.
@inproceedings{Geleijnse/2007/Creating,
abstract = {We present a simple method to extract information from search engine snippets. Although the techniques presented are domain independent, this work focuses on extracting biographical information of historical persons from multiple unstructured sources on the Web. We first similarly find a list of persons and their periods of life by querying the periods and scanning the retrieved snippets for person names. Subsequently, we find biographical informationfor the persons extracted. In order to get insight in the mutual relations among the persons identified, we create a social network using co-occurrences on the Web. Although we use uncontrolled and unstructured Web sources, the information extracted is reliable. Moreover we show that Web Information Extraction can be used to create both informative and enjoyable applications.},
added-at = {2007-11-07T19:13:58.000+0100},
address = {Berlin, Heidelberg},
author = {Geleijnse, Gijs and Korst, Jan},
biburl = {https://www.bibsonomy.org/bibtex/2bc1ab4d450673d892a7b7edbbae635bf/iswc2007},
booktitle = {Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea},
crossref = {http://data.semanticweb.org/conference/iswc-aswc/2007/proceedings},
editor = {Aberer, Karl and Choi, Key-Sun and Noy, Natasha and Allemang, Dean and Lee, Kyung-Il and Nixon, Lyndon J B and Golbeck, Jennifer and Mika, Peter and Maynard, Diana and Schreiber, Guus and Cudré-Mauroux, Philippe},
interhash = {ce6c966178954244ca9345389eabdc34},
intrahash = {bc1ab4d450673d892a7b7edbbae635bf},
keywords = {2007 dead information_extraction iswc natural_language_processing network ontology_(computer_science) person poet research_15 semantic_web social society web},
month = {November},
pages = {155--168},
publisher = {Springer Verlag},
series = {LNCS},
timestamp = {2007-11-07T19:20:53.000+0100},
title = {Creating a Dead Poets Society: Extracting a Social Network of Historical Persons from the Web},
url = {http://iswc2007.semanticweb.org/papers/155.pdf},
volume = 4825,
year = 2007
}