Building and operating large-scale information retrieval systems used by hundreds of millions of people around the world provides a number of interesting challenges. Designing such systems requires making complex design tradeoffs in a number of dimensions, including (a) the number of user queries that must be handled per second and the response latency to these requests, (b) the number and size of various corpora that are searched, (c) the latency and frequency with which documents are updated or added to the corpora, and (d) the quality and cost of the ranking algorithms that are used for retrieval. In this talk I'll discuss the evolution of Google's hardware infrastructure and information retrieval systems and some of the design challenges that arise from ever-increasing demands in all of these dimensions. I'll also describe how we use various pieces of distributed systems infrastructure when building these retrieval systems. Finally, I'll describe some future challenges and open research problems in this area.
1st Part: "Prof. Bruns, Prof. Burguess & Dr. Woodford: Mapping Online Publics: New Methods for Twitter Research"
2nd Part: "Robert Jäschke: Identifying and Analyzing Researchers on Twitter"
Earlier this week the UK Conservative party promised to offer a £1m cash prize to a person or team that creates an online platform that can be used to solve “common problems”. The prize – which the party says will
ZXing (pronounced "zebra crossing") is an open-source, multi-format 1D/2D barcode image processing library implemented in Java. Our focus is on using the built-in camera on mobile phones to photograph and decode barcodes on the device, without communicating with a server. We currently have production-quality support for:
Das ZVAB - Zentrales Verzeichnis Antiquarischer Bücher - ist weltweit das größte Online-Antiquariat für deutschsprachige Titel. Über 3700 professionelle Antiquare aus 21 Ländern bieten auf zvab.com Millionen antiquarische oder vergriffene Bücher in vielen Sprachen sowie Noten, Graphiken, Autographen und Postkarten zum Kauf an.
Cocktails ganz einfach aus der eigenen Hausbar zaubern. Jede Menge Cocktailrezepte und eine nette Cocktail-Community mit einem eigenen Forum finden sie bei Cocktailscout.de
This is a reference card for zsh. It was created based on version 3.1.9 so is a little out-of-date. It is seven pages long, even with three columns per page, so you will need a very big piece of card to stick it to.
Alf Eaton stellt in HubLog ein Skript vor, daß er für Google Texte & Tabellen (kurz Google Docs) geschrieben hat. Nach Installation des Skripts innerhalb der Google-Dienste durchsucht es auf Anforderung ein Dokument nach DOIs, ermittelt anhand dieser DOIs gleichförmig aufgebaute, gut lesbare Quellenangaben, erstellt aus diesen Quellenangaben eine Bibliographie im Anhang des Dokuments, und…
ZBar is an open source software suite for reading bar codes from various sources, such as video streams, image files and raw intensity sensors. It supports many popular symbologies (types of bar codes) including EAN-13/UPC-A, UPC-E, EAN-8, Code 128, Code 39 and Interleaved 2 of 5.
The World Atlas of Language Structures (WALS) is a large database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials (such as reference grammars) by a team of 55 authors.
Wordle is a toy for generating “word clouds” from text that you provide. The clouds give greater prominence to words that appear more frequently in the source text. You can tweak your clouds with different fonts, layouts, and color schemes. The images you create with Wordle are yours to use however you like. You can print them out, or save them to the Wordle gallery to share with your friends.
The aim of this project is to produce age-appropriate non-fiction books for children from birth to age 12. These books are richly illustrated with photographs, diagrams, sketches, and original drawings. Wikijunior books are produced by a worldwide community of writers, teachers, students, and young people all working together. The books present factual information that is verifiable. You are invited to join in and write, edit, and rewrite each module and book to improve its content. Our books are distributed free of charge under the terms of the Creative Commons Attribution-ShareAlike License.
A Wiki website of Calls For Papers (CFP) of international conferences, workshops, meetings, seminars, events, journals and book chapters in computer science, communications, software engineering, artificial intelligence, machine learning, networking, signal processing, systems etc.
A Wiki website of Calls For Papers (CFP) of international conferences, workshops, meetings, seminars, events, journals and book chapters in computer science, communications, software engineering, artificial intelligence, machine learning, networking, signal processing, systems etc.
On the “social web” or “web2.0″, where user participation is entirely voluntarily, User Motivation has been identified as a key factor in the mechanisms contributing to the success of tagging systems. Web researchers are trying to identify the reasons why tagging systems work for a couple of years now, evident in, for example, the organization of a panel at CHI 2006 and a number of conferences and workshops on this topic.
This article describes common misconceptions about Uniform Resource Locator (URL) encoding, then attempts to clarify URL encoding for HTTP, before presenting frequent problems and their solutions. While this article is not specific to any programming language, we illustrate the problems in Java and finish by explaining how to fix URL encoding problems in Java, and in a web application at several levels.
Welcome to the Ranking Web of Universities, also known as Webometrics Ranking. The new edition has been published on the 7th of February based on data collected during the first days of January. It provides the largest and most updated directory and ranking of higher education institutions (now over 21000!) in the world.
Workshop Topics
Possible topics of the workshop include (but are not limited to):
* Social network analysis
* Bibliometrics
* Community discovery
* Personalization for search and for social interaction
* Recommender systems
* Web mining algorithms
* Applications of social network analysis
* Mining (Collaborative) Tagging Systems (blogs, wikis, etc.)
* Mining social data for multimedia information retrieval
* Opinion mining
C. Kater, and R. Jäschke. Proceedings of the 1st International Workshop on Online Safety, Trust and Fraud Prevention, page 2:1--2:6. New York, NY, USA, ACM, (June 2016)
A. Mislove, B. Viswanath, K. Gummadi, and P. Druschel. Proceedings of the Third ACM International Conference on Web Search and Data Mining, page 251--260. New York, NY, USA, ACM, (2010)
Z. Cheng, J. Caverlee, and K. Lee. Proceedings of the 19th ACM International Conference on Information and Knowledge Management, page 759--768. New York, NY, USA, ACM, (2010)
F. Suchanek, G. Kasneci, and G. Weikum. Proceedings of the 16th international conference on World Wide Web, page 697--706. New York, NY, USA, ACM, (2007)
E. Yeh, D. Ramage, C. Manning, E. Agirre, and A. Soroa. Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing, page 41--49. Stroudsburg, PA, USA, Association for Computational Linguistics, (2009)
M. Strube, and S. Ponzetto. Proceedings of the National Conference on Artificial Intelligence, 21, page 1419. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, (2006)