This page provides a large hyperlink graph for public download. The graph has been extracted from the Common Crawl 2012 web corpus and covers 3.5 billion web pages and 128 billion hyperlinks between these pages. To the best of our knowledge, this graph is the largest hyperlink graph that is available to the public outside companies such as Google, Yahoo, and Microsoft. Below we provide instructions on how to download the graph as well as basic statistics about its topology.
This document is designed as being a simple but comprehensive introductory publication for anybody trying to get into the Semantic Web: from beginners through to long time hackers.
B. Berendt, A. Hotho, und G. Stumme. Web Semantics: Science, Services and Agents on the World Wide Web, 8 (2-3):
95 - 96(2010)Bridging the Gap--Data Mining and Social Network Analysis for Integrating Semantic Web and Web 2.0; The Future of Knowledge Dissemination: The Elsevier Grand Challenge for the Life Sciences.
A. Hotho, R. Jaeschke, und K. Lerman. Semantic Web, 8 (5):
623--624(April 2017)2017 IOS Press and the authors. This is an author produced version of a paper subsequently published in Semantic Web. Uploaded in accordance with the publisher's self-archiving policy..
B. Berendt, N. Glance, und A. Hotho (Hrsg.) Workshop at 18th Europ. Conf. on Machine Learning (ECML'08) / 11th Europ. Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD'08), (2008)
H. Dai, und B. Mobasher. Proceedings of the Second Semantic Web Mining Workshop at PKDD 2001, km.aifb.uni-karlsruhe.de/semwebmine2002/papers/full/bamshad.pdf, (August 2002)
C. Kemp, und K. Ramamohanarao. Proceedings of the 6th European Conference on Principles
of Data Mining and Knowledge Discovery (PKDD 2002), Seite 263--274. Berlin, Springer, (2002)
Y. Sure, S. Bloehdorn, P. Haase, J. Hartmann, und D. Oberle. Proceedings of the 12th Portuguese Conference on Artificial Intelligence - Progress in Artificial Intelligence (EPIA 2005), Volume 3803 von LNCS, Seite 218 - 231. Covilha, Portugal, Springer, (Dezember 2005)
S. Staab, J. Angele, S. Decker, A. Hotho, A. Maedche, H. Schnurr, R. Studer, und Y. Sure. AAAI 2000/IAAI 2000 - Proceedings of the 17th National Conference on Artificial Intelligence and 12th Innovative Applications of Artificial Intelligence Conference, Austin/TX, USA, July 30-August 3, 2000, AAAI Press/MIT Press, (2000)
A. Schenker, H. Bunke, M. Last, und A. Kandel. Document Analysis Systems, Volume 3163 von Lecture Notes in Computer Science, Seite 401-412. Springer, (2004)
D. Lawrie, und W. Croft. Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2003, Seite 457--458. (2003)
R. Studer, G. Stumme, S. Handschuh, A. Hotho, und B. Motik. New Trends in Knowledge Processing - Data Mining, Semantic Web and Computational Science. Proc. 6th Sanken International Symposium, Seite 31-34. Osaka, Japan, (März 2003)March 10-11, 2003.
B. Krause, R. Jäschke, A. Hotho, und G. Stumme. HT '08: Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, Seite 157--166. New York, NY, USA, ACM, (2008)
L. Specia, und E. Motta. Proc. of the European Semantic Web Conference (ESWC2007), Volume 4519 von LNCS, Seite 624-639. Berlin Heidelberg, Germany, Springer-Verlag, (Juli 2007)
T. Joachims, D. Freitag, und T. Mitchell. Proceedings of the International Joint Conference on
Artificial Intelligence (IJCAI), Seite 770--777. San Francisco, CA, Morgan Kaufmann, (1997)
B. Berendt, A. Hotho, und G. Stumme. Proc. of the 1st Intl. Workshop on Representation and Analysis of Web Space, Seite 1--16. Technical University of Ostrava, (2005)
D. Oberle, B. Berendt, A. Hotho, und J. Gonzalez. Advances in Web Intelligence, First International Atlantic Web Intelligence Conference, AWIC 2003, Madrid, Spain, May 5-6, 2003, Proceedings, Volume 2663 von Lecture Notes in Artificial Intelligence, Seite 142-154. Springer, (2003)
B. Berendt, A. Hotho, und G. Stumme. Proceedings of the First International Semantic Web Conference: The Semantic Web (ISWC 2002), Volume 2342 von Lecture Notes in Computer Science (LNCS), Seite 264-278. Sardinia, Italy, Springer, (2002)
T. Tran, N. Tran, A. Teka Hadgu, und R. Jäschke. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, (September 2015)
R. Cooley, B. Mobasher, und J. Srivastava. Proceedings of the Ninth IEEE International Conference on Tools with Artificial Intelligence (ICTAI'97), IEEE Computer Society, (November 1997)