| Authors: |
Lei Zhang
and QiaoLing Liu
and Jie Zhang
and Haofen Wang
and Yue Pan
and Yong Yu
|
| Editors: |
Karl Aberer
and Key-Sun Choi
and Natasha Noy
and Dean Allemang
and Kyung-Il Lee
and Lyndon J B Nixon
and Jennifer Golbeck
and Peter Mika
and Diana Maynard
and Guus Schreiber
and Philippe Cudré-Mauroux
|
| URL: |
http://iswc2007.semanticweb.org/papers/645.pdf |
| Tags: |
2007
application_software
approach
data_management
datum
hybrid
ir
iswc
query
research_13
scalable
semantic
semantic_web
web
|
| Abstract: |
As an extension to the current Web, Semantic Web will not only contain structured data with machine understandable semantics but also textual information. While structured queries can be used to find information more precisely on the Semantic Web, keyword searches are still needed to help exploit textual information. It thus becomes very important that we can combine precise structured queries with imprecise keyword searches to have a hybrid query capability. In addition, due to the huge volume of information on the Semantic Web, the hybrid query must be processed in a very scalable way. In this paper, we define such a hybrid query capability that combines unary tree-shaped structured queries with keyword searches. We show how existing information retrieval (IR) index structures and functions can be reused to index semantic web data and its textual information, and how the hybrid query is evaluated on the index structure using IR engines in an efficient and scalable manner. We implemented this IR approach in an engine called Semplore. Comprehensive experiments on its performance show that it is a promising approach. It leads us to believe that it may be possible to evolve current web search engines to query and search the Semantic Web. Finally, we breifly describe how Semplore is used for searching Wikipedia and an IBM customer's product information. |
@inproceedings{Zhang/2007/Semplore:,
title = {Semplore: An IR Approach to Scalable Hybrid Query of Semantic Web Data},
address = {Berlin, Heidelberg},
author = {Lei Zhang and QiaoLing Liu and Jie Zhang and Haofen Wang and Yue Pan and Yong Yu},
booktitle = {Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007), Busan, South Korea},
crossref = {http://data.semanticweb.org/conference/iswc-aswc/2007/proceedings},
editor = {Karl Aberer and Key-Sun Choi and Natasha Noy and Dean Allemang and Kyung-Il Lee and Lyndon J B Nixon and Jennifer Golbeck and Peter Mika and Diana Maynard and Guus Schreiber and Philippe Cudré-Mauroux},
month = {November},
pages = {645--658},
publisher = {Springer Verlag},
series = {LNCS},
url = {http://iswc2007.semanticweb.org/papers/645.pdf},
volume = {4825},
year = {2007},
abstract = {As an extension to the current Web, Semantic Web will not only contain structured data with machine understandable semantics but also textual information. While structured queries can be used to find information more precisely on the Semantic Web, keyword searches are still needed to help exploit textual information. It thus becomes very important that we can combine precise structured queries with imprecise keyword searches to have a hybrid query capability. In addition, due to the huge volume of information on the Semantic Web, the hybrid query must be processed in a very scalable way. In this paper, we define such a hybrid query capability that combines unary tree-shaped structured queries with keyword searches. We show how existing information retrieval (IR) index structures and functions can be reused to index semantic web data and its textual information, and how the hybrid query is evaluated on the index structure using IR engines in an efficient and scalable manner. We implemented this IR approach in an engine called Semplore. Comprehensive experiments on its performance show that it is a promising approach. It leads us to believe that it may be possible to evolve current web search engines to query and search the Semantic Web. Finally, we breifly describe how Semplore is used for searching Wikipedia and an IBM customer's product information.},
keywords = {2007 application_software approach data_management datum hybrid ir iswc query research_13 scalable semantic semantic_web web }
}