The term “Semantic Search” is certainly not new. However, it has taken on a new dimension and implications in both search and social engines today. In addition, it has had a strong impact on targeted semantic advertising.
DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data. We hope this will make it easier for the amazing amount of information in Wikipedia to be used in new and interesting ways, and that it might inspire new mechanisms for navigating, linking and improving the encyclopaedia itself.
Katta is a scalable, failure tolerant, distributed, data storage for real time access.
Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles.
* Makes serving large or high load indices easy
* Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers
* Replicate shards on different servers for performance and fault-tolerance
* Supports pluggable network topologies
* Master fail-over
* Fast, lightweight, easy to integrate
* Plays well with Hadoop clusters
* Apache Version 2 License
J. Waitelonis, and H. Sack. Proceedings of the 2nd IEEE International Workshop on Data Semantics for Multimedia Systems and Applications (DSMSA), in conjunction with IEEE International Symposium on Multimedia (ISM), 14-16 December, 2009, San Diego, California, USA, page 540--545. IEEE Computer Society, (2009)
J. Waitelonis, M. Knuth, L. Wolf, J. Hercher, and H. Sack. Proceedings of the Workshop on Linked Data in the Future Internet at the Future Internet Assembly, December 16-17, 2010, Ghent, Belgium, CEUR Workshop Proceedings, 700, (2010)
G. Manku, A. Jain, and A. Sarma. WWW '07: Proceedings of the 16th international conference on World Wide Web, page 141--150. New York, NY, USA, ACM, (2007)
R. Baeza-Yates, L. Calderón-Benavides, and C. González-Caro. Proceedings of String Processing and Information Retrieval (SPIRE ), volume 4209 of Lecture Notes in Computer Science, page 98--109. Springer, (2006)
X. Wang, and C. Zhai. Proceedings of the 30 th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2007, (2007)