tag :: mapreduce | BibSonomy

bookmarks (hide)166
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

4Data-Intensive Text Processing with MapReduce
Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well.
15 years ago by @flowolf
show all tags
hadoop-group
awmhadoop
awm2010
data
mapreduce
processing
hadoop
text
intensive
hadoop-groupawmhadoopawm2010datamapreduceprocessinghadooptextintensive
copydelete
- community post
- history of this post
2Calculating the Jaccard Similarity Coeffcient with Map Reduce for Entity Pairs in Wikipedia
Calculating the Jaccard Similarity Coeffcient with Map Reduce for Entity Pairs in Wikipedia
15 years ago by @muehlburger
show all tags
awmhadoop
Hadoop
awm2010
jaccard
similarity
map
calculating
coeffcient
MapReduce
awmhadoopHadoopawm2010jaccardsimilaritymapcalculatingcoeffcientMapReduce
copydelete
- community post
- history of this post
1Mapreduce & Hadoop Algorithms in Academic Papers (3rd update)
http://atbrox.com/2010/05/08/mapreduce-hadoop-algorithms-in-academic-papers-may-2010-update/
15 years ago by @muehlburger
show all tags
awmhadoop
algorithms
awm2010
mapreduce
hadoop
papers
publications
awmhadoopalgorithmsawm2010mapreducehadooppaperspublications
copydelete
- community post
- history of this post
1HadoopMapReduce - Hadoop Wiki
Introduction This document describes how Map and Reduce operations are carried out in Hadoop. If you are not familiar with the Google [WWW] MapReduce programming model you should get acquainted with it first.
16 years ago by @carlfischer
show all tags
apache
cluster
mapreduce
hadoop
google
programming
apacheclustermapreducehadoopgoogleprogramming
copydelete
- community post
- history of this post
7Google Research Publication: MapReduce
MapReduce: Simplified Data Processing on Large Clusters
16 years ago by @carlfischer
show all tags
cluster
parallel
mapreduce
distributed
google
programming
clusterparallelmapreducedistributedgoogleprogramming
copydelete
- community post
- history of this post
1Hbase - Hadoop Wiki
HBase: Bigtable-like structured storage for Hadoop HDFS Just as Google's [WWW] Bigtable leverages the distributed data storage provided by the [WWW] Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Core. Data is organized into tables, rows and columns. An Iterator-like interface is available for scanning through a row range (and of course there is the ability to retrieve a column value for a specific key). Any particular column may have multiple versions for the same row key.
16 years ago by @carlfischer
show all tags
java
cluster
database
mapreduce
distributed
bigtable
hadoop
hbase
javaclusterdatabasemapreducedistributedbigtablehadoophbase
copydelete
- community post
- history of this post
6Cascading
http://www.cascading.org/
15 years ago by @dolefulrabbit
show all tags
framework
java
cloud
grid
clustering
mapreduce
distributed
opensource
hadoop
frameworkjavacloudgridclusteringmapreducedistributedopensourcehadoop
copydelete
- community post
- history of this post
1Amazon Web Services Blog: Amazon Elastic MapReduce Now Available in Europe
http://aws.typepad.com/aws/2009/07/amazon-elastic-mapreduce-now-available-in-europe.html
15 years ago by @dolefulrabbit
show all tags
cloud
amazon
mapreduce
elastic
cloudamazonmapreduceelastic
copydelete
- community post
- history of this post
1Amazon Web Services Developer Community : Introduction to Amazon Elastic MapReduce
http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2297
15 years ago by @dolefulrabbit
show all tags
mapreduce
howto
tutorial
introduction
mapreducehowtotutorialintroduction
copydelete
- community post
- history of this post
1MapReduce - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Mapreduce
14 years ago by @stroeh
show all tags
mapreduce
hadoop
nosql
mapreducehadoopnosql
copydelete
- community post
- history of this post
4Jimmy Lin � Data-Intensive Text Processing with MapReduce
http://www.umiacs.umd.edu/~jimmylin/book.html
14 years ago by @schmitz
show all tags
book
mapreduce
free
download
textmining
bookmapreducefreedownloadtextmining
copydelete
- community post
- history of this post
1Python Cloud: Google's MapReduce in 98 Lines of Python
http://clouddbs.blogspot.com/2010/10/googles-mapreduce-in-98-lines-of-python.html
14 years ago by @brightbyte
show all tags
python
mapreduce
pythonmapreduce
copydelete
- community post
- history of this post
2MapReduce - MongoDB
http://www.mongodb.org/display/DOCS/MapReduce
13 years ago by @nosebrain
show all tags
mapreduce
mongodb
mapreducemongodb
copydelete
- community post
- history of this post
1SHARD Triple-Store | Download SHARD Triple-Store software for free at SourceForge.net
http://sourceforge.net/projects/shard-3store/
13 years ago by @zazi
show all tags
Triple_Store
Hadoop
Scaling
HDFS
Semantic_Web_Technology
Semantic_Web
MapReduce
SHARD
Triple_StoreHadoopScalingHDFSSemantic_Web_TechnologySemantic_WebMapReduceSHARD
copydelete
- community post
- history of this post
4MapReduce - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/MapReduce
11 years ago by @aenikata
show all tags
mapreduce
nosql
mapreducenosql
copydelete
- community post
- history of this post
3Running Hadoop On Ubuntu Linux (Single-Node Cluster) - Michael G. Noll
http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_(Single-Node_Cluster)
15 years ago by @muehlburger
show all tags
java
linux
development
webscience
awm2010
mapreduce
howto
tutorial
hadoop
programming
javalinuxdevelopmentwebscienceawm2010mapreducehowtotutorialhadoopprogramming
copydelete
- community post
- history of this post
1Dryad - Microsoft Research
http://research.microsoft.com/en-us/projects/dryad/
13 years ago by @muehlburger
show all tags
cloud
cluster
computing
grid
data
mapreduce
distributed
hadoop
dryad
microsoft
research
scalability
cloudclustercomputinggriddatamapreducedistributedhadoopdryadmicrosoftresearchscalability
copydelete
- community post
- history of this post
1Karmasphere
http://www.karmasphere.com/
13 years ago by @muehlburger
show all tags
cloudcomputing
cloud
mapreduce
ide
opensource
hadoop
tools
cloudcomputingcloudmapreduceideopensourcehadooptools
copydelete
- community post
- history of this post
1HIVE: Data Warehousing & Analytics on Hadoop
http://www.slideshare.net/zshao/hive-data-warehousing-analytics-on-hadoop-presentation
13 years ago by @muehlburger
show all tags
analytics
database
facebook
clustering
datamining
mapreduce
hadoop
technology
presentations
hive
scalability
analyticsdatabasefacebookclusteringdataminingmapreducehadooptechnologypresentationshivescalability
copydelete
- community post
- history of this post
1CouchDB Vs MongoDB
http://www.slideshare.net/gabriele.lana/couchdb-vs-mongodb-2982288
12 years ago by @schmidt2
show all tags
slides
mapreduce
couchdb
mongodb
slidesmapreducecouchdbmongodb
copydelete
- community post
- history of this post

publications (hide)74
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

5Map-Reduce for Machine Learning on Multicore
C. Chu, S. Kim, Y. Lin, Y. Yu, G. Bradski, A. Ng, and K. Olukotun. (2006)
17 years ago by @jhammerb
show all tags
multicore
parallel
datamining
mapreduce
machinelearning
multicoreparalleldataminingmapreducemachinelearning
copydeleteadd this publication to your clipboard
1Google's MapReduce programming model &\#8212; Revisited
R. Lämmel. Sci. Comput. Program., 68 (3): 208--237 (2007)
17 years ago by @jhammerb
show all tags
retrieval
large_scale
parallel
grid
mapreduce
distributed
retrievallarge_scaleparallelgridmapreducedistributed
copydeleteadd this publication to your clipboard
25MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
7 years ago by @becker
show all tags
citedby:scholar:timestamp:2017-5-16
inthesis
mapreduce
sparktrails
diss
citedby:scholar:count:20792
paper:fastcor
citedby:scholar:timestamp:2017-5-16inthesismapreducesparktrailsdisscitedby:scholar:count:20792paper:fastcor
copydeleteadd this publication to your clipboard
1MapReduce - Konzept
T. König. (2010)
11 years ago by @muehsi
show all tags
Verfahren
Map
MapReduce
Reduce
VerfahrenMapMapReduceReduce
copydeleteadd this publication to your clipboard
3Google's MapReduce Programming Model - Revisited
R. Lämmel. Science of Computer Programming, 70 (1): 1--30 (2008)
13 years ago by @gron
show all tags
Survey
Overview
MapReduce
SurveyOverviewMapReduce
copydeleteadd this publication to your clipboard
1Privacy Preservation in Analyzing EHealth Records in Big Data Environment
E. Srimathi, and K. Apoorva. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (4): 2421--2427 (April 2015)
9 years ago by @ijritcc
show all tags
Down
Specialization
Top
Anonymization
Data
BigData
k
Anonymity
MapReduce
DownSpecializationTopAnonymizationDataBigDatakAnonymityMapReduce
copydeleteadd this publication to your clipboard
1LARGE-SCALE DATA PROCESSING USING MAPREDUCE IN CLOUD COMPUTING ENVIRONMENT
S. Daneshyar, and M. Razmjoo. International Journal on Web Service Computing (IJWSC), 3 (4): 01-13 (December 2012)
9 days ago by @ijwsc
show all tags
cloud
parallel
computing
Hadoop
and
distributed
processing
MapReduce
cloudparallelcomputingHadoopanddistributedprocessingMapReduce
copydeleteadd this publication to your clipboard
1Churn Prediction using MapReduce and HBase
G. Limaye, J. Chaudhary, and P. Punjabi. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (3): 1699--1703 (March 2015)
9 years ago by @ijritcc
show all tags
Hadoop
Prediction
C4.5
Churn
MapReduce
HBase
HadoopPredictionC4.5ChurnMapReduceHBase
copydeleteadd this publication to your clipboard
1State Space Exploration of RT Systems in the Cloud
C. Bellettini, M. Camilli, L. Capra, and M. Monga. CoRR, (2012)
12 years ago by @carlobellettini
show all tags
cloud
petrinet
mapreduce
analysis
cloudpetrinetmapreduceanalysis
copydeleteadd this publication to your clipboard
25MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (January 2008)
12 years ago by @jaeschke
show all tags
cloud
database
parallel
computing
data
mapreduce
processing
clouddatabaseparallelcomputingdatamapreduceprocessing
copydeleteadd this publication to your clipboard
1Fast Detection of Connected Components in Large Scale Graphs Using MapReduce
M. Ali Varamesh1. IOSR Journal of Engineering (IOSRJEN), (February 2014)
11 years ago by @agibhardt
show all tags
Graph
connected
components
Connected
Pegasus
Mapreduce
CC-MR
GraphconnectedcomponentsConnectedPegasusMapreduceCC-MR
copydeleteadd this publication to your clipboard
21MapReduce: Simplified Data Processing on Large Clusters
J. Dean, and S. Ghemawat. OSDI, (2004)
14 years ago by @flowolf
show all tags
hadoop-group
awmhadoop
awm2010
data
mapreduce
processing
hadoop
simplified
google
hadoop-groupawmhadoopawm2010datamapreduceprocessinghadoopsimplifiedgoogle
copydeleteadd this publication to your clipboard
21MapReduce: Simplified Data Processing on Large Clusters
J. Dean, and S. Ghemawat. (2004)
12 years ago by @telekoma
show all tags
framework
ws1213
mapreduce
master
seminar:dfs
uni
frameworkws1213mapreducemasterseminar:dfsuni
copydeleteadd this publication to your clipboard
4Dremel: Interactive Analysis of Web-scale Datasets
S. Melnik, A. Gubarev, J. Long, G. Romer, S. Shivakumar, M. Tolton, and T. Vassilakis. Communications of the ACM, (June 2011)
12 years ago by @nosebrain
show all tags
dremel
mapreduce
google
dremelmapreducegoogle
copydeleteadd this publication to your clipboard
25MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
15 years ago by @sb3000
show all tags
bigdata
mapreduce
bigdatamapreduce
copydeleteadd this publication to your clipboard
25MapReduce: Simplified Data Processing on Large Clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107-113 (2008)
17 years ago by @castagna
show all tags
mapreduce
mapreduce
copydeleteadd this publication to your clipboard
1CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON MAPREDUCE FRAMEWORK
S. Gole, and B. Tidke. International Journal on Foundations of Computer Science & Technology (IJFCST), 5 (3): 11 (May 2015)
a year ago by @devino
show all tags
Association
Clustering
Itemset
Data
Rule
Frequent
MapReduce
Big
Mining
AssociationClusteringItemsetDataRuleFrequentMapReduceBigMining
copydeleteadd this publication to your clipboard
3Exhaustive search algorithms to mine subgroups on Big Data using Apache Spark
F. Padillo, J. Luna, and S. Ventura. Progress in Artificial Intelligence, (2017)
8 years ago by @becker
show all tags
subgroup
subgroups
parallel
spark
emm
mapreduce
distributed
subgroupsubgroupsparallelsparkemmmapreducedistributed
copydeleteadd this publication to your clipboard
1An RDF Metadata-Based Weighted Semantic Pagerank Algorithm
H. Hee-Gook Jun, Dong-Hyuk Im Kim. International Journal of Web & Semantic Technology (IJWesT), 7 (2): 11-24 (April 2016)
8 years ago by @laimbee
show all tags
PageRank,
RDF,
Semantic
MapReduce
Data,
Big
Web,
PageRank,RDF,SemanticMapReduceData,BigWeb,
copydeleteadd this publication to your clipboard
25MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
11 years ago by @thoni
show all tags
mapreduce
seminar
thema
ss2014
mapreduceseminarthemass2014
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩