Tweets often contain URLs or links to a variety of content on the web, including images, videos, news articles and blog posts. SpiderDuck is a service at Twitter that fetches all URLs shared in Twe......
Finden Sie einfach die besten Sendungen jetzt im TV-Programm. Ihr Lieblings-Programm auf einen Blick mit Schnell-Info. Das Fernsehprogramm mit über 150 Sendern.
Spinn3r is a web service that provides raw access to posts, articles, tweets, status updates, etc. being published - in real or near real time, allowing you to focus on building your application, mashup, or search engine. We find the sources, index their content and take care of all the heavy lifting around delivering large amounts of relevant data.
Webstemmer is a web crawler and HTML layout analyzer that automatically extracts main text of a news site without having banners, ads and/or navigation links mixed up
G. Gossen, E. Demidova, and T. Risse. Proceedings of the 15th ACM/IEEE-CS Joint Conference on Digital Libraries, page 75--84. New York, NY, USA, ACM, (2015)
D. Goyal, and M. Kalra. Proceedings of the International Conference on Signal Propagation and Computer Technology (ICSPCT), page 257--262. IEEE, (July 2014)
Y. Hafri, and C. Djeraba. Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, page 299--306. New York, NY, USA, ACM Press, (2004)
A. Harth, J. Umbrich, and S. Decker. International Semantic Web Conference, volume 4273 of Lecture Notes in Computer Science, page 258-271. Springer, (2006)
A. Harth, J. Umbrich, and S. Decker. International Semantic Web Conference, volume 4273 of Lecture Notes in Computer Science, page 258-271. Springer, (2006)
T. Kaszuba, P. Turek, A. Wierzbicki, and R. Nielek. Local Proceedings of 13th East-European Conference, ADBIS 2009, page 385-398. JUMI Pubbbblishing House Ltd., (2009)