MapReduce: simplified data processing on large clusters

Аннотация

MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a <i>map</i> and a <i>reduce</i> function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the system easy to use: more than ten thousand distinct MapReduce programs have been implemented internally at Google over the past four years, and an average of one hundred thousand MapReduce jobs are executed on Google's clusters every day, processing a total of more than twenty petabytes of data per day.

ключ BibTeX: dean2008mapreduce
тип записи: article
адрес: New York, NY, USA
год: 2008
месяц: jan
журнал: Communications of the ACM
номер: 1
страницы: 107--113
издательство: ACM
том: 51
issn: 0001-0782
acmid: 1327492
numpages: 7
issue_date: January 2008
DOI: 10.1145/1327452.1327492
url: http://doi.acm.org/10.1145/1327452.1327492

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

BibSonomy

MapReduce: simplified data processing on large clusters

Аннотация

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Цитировать эту публикацию

искать в

BibSonomy

MapReduce: simplified data processing on large clusters

Аннотация

тэги

Пользователи данного ресурса

Referenced and cited publications

Комментарии и рецензиипоказать / перейти в невидимый режим

Цитировать эту публикацию

искать в