WIRE: an open-source Web information retrieval environment
C. Castillo, and R. Yates. Workshop on Open Source Web Information Retrieval (OSWIR), page 27--30. Compiegne, France, (September 2005)
Abstract
In this paper, we describe the WIRE (Web Information Retrieval Environment) project and focus on some details of its crawler component. The WIRE crawler is a scalable, highly configurable, high performance, open-source Web crawler which we have used to study the characteristics of large Web collections.
%0 Conference Paper
%1 citeulike:463020
%A Castillo, Carlos
%A Yates, Ricardo B.
%B Workshop on Open Source Web Information Retrieval (OSWIR)
%C Compiegne, France
%D 2005
%E Beigbeder, Michel
%E Yee, Wai G.
%K crawling, search
%P 27--30
%T WIRE: an open-source Web information retrieval environment
%U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.3200
%X In this paper, we describe the WIRE (Web Information Retrieval Environment) project and focus on some details of its crawler component. The WIRE crawler is a scalable, highly configurable, high performance, open-source Web crawler which we have used to study the characteristics of large Web collections.
@inproceedings{citeulike:463020,
abstract = {In this paper, we describe the WIRE (Web Information Retrieval Environment) project and focus on some details of its crawler component. The WIRE crawler is a scalable, highly configurable, high performance, open-source Web crawler which we have used to study the characteristics of large Web collections.},
added-at = {2009-08-06T15:16:38.000+0200},
address = {Compiegne, France},
author = {Castillo, Carlos and Yates, Ricardo B.},
biburl = {https://www.bibsonomy.org/bibtex/23d029d333fc14bca824b0e9df2a59de7/chato},
booktitle = {Workshop on Open Source Web Information Retrieval (OSWIR)},
citeulike-article-id = {463020},
citeulike-linkout-0 = {http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.3200},
editor = {Beigbeder, Michel and Yee, Wai G.},
interhash = {5d5e0de9724803b1e124cd3a17aa7acd},
intrahash = {3d029d333fc14bca824b0e9df2a59de7},
keywords = {crawling, search},
month = {September},
pages = {27--30},
posted-at = {2007-09-26 15:24:15},
priority = {0},
timestamp = {2009-08-06T15:16:47.000+0200},
title = {WIRE: an open-source Web information retrieval environment},
url = {http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.3200},
year = 2005
}