WebSPHINX ( Website-Specific Processors for HTML INformation eXtraction) is a Java class library and interactive development environment for web crawlers.
The Virtual Observatory for the Study of Online Networks (VOSON) Project is based at the Australian Demographic and Social Research Institute, The Australian National Univeristy. We aim to advance the Social Science of the Internet through new empirical research into online networks and the development of the following associated e-Research tools:
This is the public wiki for the Heritrix archival crawler project. Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or mis-said as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits).
2018. Welche Teile des Webs sollen für zukünftige Generationen archiviert werden? Das erkundet derzeit die Deutsche Nationalbibliothek und befragt Internetnutzer. Im Interview spricht Vizedirektorin Ute Schwens über den Stand der Dinge bei der Webarchivierung und die Auswirkungen des neuen Urheberrechts.
M. Ehrig, J. Hartmann, and C. Schmitz. Workshop ``Semantische Technologien für Informationsportale'' (GI-Jahrestagung 2004), Gesellschaft für Informatik, (September 2004)
C. Schmitz, S. Staab, R. Studer, G. Stumme, and J. Tane. Proc. of E-Learning 2002 World Conference on E-Learning in Corporate, Government, Healthcare and Higher Education on (E-Learning 2002), AACE, page 909-915. Norfolk, (2002)Awarded paper.
C. Schmitz, S. Staab, R. Studer, G. Stummen, and J. Tane. Proc. of E-Learning 2002 World Conference on E-Learning in Corporate, Government, Healthcare and Higher Education on (E-Learning 2002), AACE, page 909-915. Norfolk, (2002)Awarded paper.