Tweets often contain URLs or links to a variety of content on the web, including images, videos, news articles and blog posts. SpiderDuck is a service at Twitter that fetches all URLs shared in Twe......
Spinn3r is a web service that provides raw access to posts, articles, tweets, status updates, etc. being published - in real or near real time, allowing you to focus on building your application, mashup, or search engine. We find the sources, index their content and take care of all the heavy lifting around delivering large amounts of relevant data.
Finden Sie einfach die besten Sendungen jetzt im TV-Programm. Ihr Lieblings-Programm auf einen Blick mit Schnell-Info. Das Fernsehprogramm mit über 150 Sendern.
Purpose: A tool which will automate the crawling of AJAX applications. It can be daisy-chained with other proxies (like ZAP or Burpe) to allow the functionality of those tools to be used on aspects of a web app that traditional spidering tools will miss. Here is a demo of the tool so far: http://vimeo.com/31059474
License: GNU GPL v3
G. Feng, T. Liu, Y. Wang, Y. Bao, Z. Ma, X. Zhang, and W. Ma. SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, page 75--82. New York, NY, USA, ACM Press, (2006)
D. Gibson, R. Kumar, and A. Tomkins. VLDB '05: Proceedings of the 31st international conference on Very large data bases, page 721--732. VLDB Endowment, (2005)