flowolf > awm2010 | BibSonomy

bookmarks (hide)11
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

2Pig | Yahoo! Research
http://research.yahoo.com/node/90
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
hadopp
pig
yahoo
awm2010awmhadoophadopppigyahoo
(0)
copydelete
- community post
- history of this post
2Parallel Machine Learning for Hadoop/Mapreduce – A Python Example
Parallel Machine Learning for Hadoop/Mapreduce – A Python Example
15 years ago by @flowolf
show all tags
Python
awm2010
awmhadoop
hadoop
hadoop-group
learning
machine
mapreduce
parallel
Pythonawm2010awmhadoophadoophadoop-grouplearningmachinemapreduceparallel
(0)
copydelete
- community post
- history of this post
2Home - dumbo - GitHub
Python module that allows you to easily write and run Hadoop programs.
15 years ago by @flowolf
show all tags
awm2010
awmhadoop
dumbo
github
hadoop
hadoop-group
python
awm2010awmhadoopdumbogithubhadoophadoop-grouppython
(0)
copydelete
- community post
- history of this post
3Running Hadoop On Ubuntu Linux (Single-Node Cluster) - Michael G. Noll
The home page of Michael G. Noll - working for a Safer Internet
15 years ago by @flowolf
show all tags
awm2010
awmhadoop
hadoop
hadoop-group
howto
linux
michael
noll
running
single
ubuntu
awm2010awmhadoophadoophadoop-grouphowtolinuxmichaelnollrunningsingleubuntu
(0)
copydelete
- community post
- history of this post
2Hadoop - Michael G. Noll
The home page of Michael G. Noll - working for a Safer Internet
15 years ago by @flowolf
show all tags
awm2010
awmhadoop
hadoop
hadoop-group
howto
michael
noll
awm2010awmhadoophadoophadoop-grouphowtomichaelnoll
(0)
copydelete
- community post
- history of this post
3Jimmy Lin
Jimmy Lin
15 years ago by @flowolf
show all tags
MapReduce
awm2010
awmhadoop
data-processing
google
hadoop
hadoop-group
jimmy
lin
MapReduceawm2010awmhadoopdata-processinggooglehadoophadoop-groupjimmylin
(0)
copydelete
- community post
- history of this post
2Calculating the Jaccard Similarity Coeffcient with Map Reduce for Entity Pairs in Wikipedia
Calculating the Jaccard Similarity Coeffcient with Map Reduce for Entity Pairs in Wikipedia
15 years ago by @flowolf
show all tags
Hadoop
MapReduce
awm2010
awmhadoop
calculating
coeffcient
hadoop-group
jaccard
map
similarity
HadoopMapReduceawm2010awmhadoopcalculatingcoeffcienthadoop-groupjaccardmapsimilarity
(0)
copydelete
- community post
- history of this post
5Data-Intensive Text Processing with MapReduce
Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well.
15 years ago by @flowolf
show all tags
awm2010
awmhadoop
data
hadoop
hadoop-group
intensive
mapreduce
processing
text
awm2010awmhadoopdatahadoophadoop-groupintensivemapreduceprocessingtext
(0)
copydelete
- community post
- history of this post
3Data-Intensive Information Processing Applications
This course is about scalable approaches to processing large amounts of information (terabytes and even petabytes). We focus mostly on MapReduce, which is presently the most accessible and practical means of computing at this scale, but will discuss other approaches as well.
15 years ago by @flowolf
show all tags
MapReduce
applications
awm2010
awmhadoop
data
hadoop
hadoop-group
information
intensive
processing
MapReduceapplicationsawm2010awmhadoopdatahadoophadoop-groupinformationintensiveprocessing
(0)
copydelete
- community post
- history of this post
19Apache Hadoop
http://hadoop.apache.org/
15 years ago by @flowolf
show all tags
AWM2010
apache
awmhadoop
hadoop
hadoop-group
AWM2010apacheawmhadoophadoophadoop-group
(0)
copydelete
- community post
- history of this post
2map reduce
http://www.usenix.org/events/osdi04/tech/full_papers/dean/dean_html/
15 years ago by @flowolf
show all tags
awm2010
awmhadoop
google
hadoop
hadoop-group
map
mapreduce
reduce
awm2010awmhadoopgooglehadoophadoop-groupmapmapreducereduce
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)10
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Efﬁcient Parallel Set-Similarity Joins Using MapReduce
V. Rares, C. J., and L. Chen. (2010)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
efficient
hadoop
joins
parallel
set
similarity
awm2010awmhadoopefficienthadoopjoinsparallelsetsimilarity
(0)
copydeleteadd this publication to your clipboard
2Building a high-level dataflow system on top of Map-Reduce: the Pig experience
A. Gates, O. Natkovich, S. Chopra, P. Kamath, S. Narayanamurthy, C. Olston, B. Reed, S. Srinivasan, and U. Srivastava. Proc. VLDB Endow., 2 (2): 1414--1425 (2009)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
building
dataflow
hadoop-group
high
level
system
awm2010awmhadoopbuildingdataflowhadoop-grouphighlevelsystem
(0)
copydeleteadd this publication to your clipboard
4Pig latin: a not-so-foreign language for data processing
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, page 1099--1110. New York, NY, USA, ACM, (2008)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
data
foreign
hadoop
hadoop-group
language
latin
pig
awm2010awmhadoopdataforeignhadoophadoop-grouplanguagelatinpig
(0)
copydeleteadd this publication to your clipboard
3Map-reduce-merge: simplified relational data processing on large clusters
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
14 years ago by @flowolf
show all tags
algorithms
awm2010
awmhadoop
distributed
grid
hadoop
hadoop-group
map
mapreduce
merge
parallel
reduce
relational
algorithmsawm2010awmhadoopdistributedgridhadoophadoop-groupmapmapreducemergeparallelreducerelational
(0)
copydeleteadd this publication to your clipboard
2Terabyte Sort on Apache Hadoop
O. Yahoo!. (Mai 2008 2008)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
hadoop-group
sort
terabyte
awm2010awmhadoophadoop-groupsortterabyte
(0)
copydeleteadd this publication to your clipboard
6Graph Twiddling in a MapReduce World
J. Cohen. Computing in Science and Engineering, (2009)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
cohen
graph
hadoop-group
mapreduce
twiddling
world
awm2010awmhadoopcohengraphhadoop-groupmapreducetwiddlingworld
(0)
copydeleteadd this publication to your clipboard
7Hadoop: The Definitive Guide
T. White. O'Reilly, first edition edition, (June 2009)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
definitive
guide
hadoop
hadoop-group
white
awm2010awmhadoopdefinitiveguidehadoophadoop-groupwhite
(0)
copydeleteadd this publication to your clipboard
25MapReduce: Simplified Data Processing on Large Clusters
J. Dean, and S. Ghemawat. OSDI, (2004)
14 years ago by @flowolf
show all tags
awm2010
awmhadoop
data
google
hadoop
hadoop-group
mapreduce
processing
simplified
awm2010awmhadoopdatagooglehadoophadoop-groupmapreduceprocessingsimplified
(0)
copydeleteadd this publication to your clipboard
3Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce
M. Husain, P. Doshi, L. Khan, and B. Thuraisingham. Cloud Computing, 5931, chapter 72, Springer Berlin Heidelberg, Berlin, Heidelberg, (2009)
14 years ago by @flowolf
show all tags
MapReduce
awm2010
awmhadoop
graph
hadoop
hadoop-group
rdf
retrieval
storage
MapReduceawm2010awmhadoopgraphhadoophadoop-grouprdfretrievalstorage
(0)
copydeleteadd this publication to your clipboard
4Max-cover in map-reduce
F. Chierichetti, R. Kumar, and A. Tomkins. WWW '10: Proceedings of the 19th international conference on World wide web, page 231--240. New York, NY, USA, ACM, (2010)
15 years ago by @flowolf
show all tags
MapReduce
awm2010
awmhadoop
cover
hadoop
hadoop-group
map
max
reduce
MapReduceawm2010awmhadoopcoverhadoophadoop-groupmapmaxreduce
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

bookmarks (hide)11
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

2Pig | Yahoo! Research

2Parallel Machine Learning for Hadoop/Mapreduce – A Python Example

2Home - dumbo - GitHub

3Running Hadoop On Ubuntu Linux (Single-Node Cluster) - Michael G. Noll

2Hadoop - Michael G. Noll

3Jimmy Lin

2Calculating the Jaccard Similarity Coeffcient with Map Reduce for Entity Pairs in Wikipedia

5Data-Intensive Text Processing with MapReduce

3Data-Intensive Information Processing Applications

19Apache Hadoop

2map reduce

publications (hide)10
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Efﬁcient Parallel Set-Similarity Joins Using MapReduce

2Building a high-level dataflow system on top of Map-Reduce: the Pig experience

4Pig latin: a not-so-foreign language for data processing

3Map-reduce-merge: simplified relational data processing on large clusters

2Terabyte Sort on Apache Hadoop

6Graph Twiddling in a MapReduce World

7Hadoop: The Definitive Guide

25MapReduce: Simplified Data Processing on Large Clusters

3Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce

4Max-cover in map-reduce

browse

related tags

concepts

tags

bookmarks (hide)11 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)10 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)11
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)10
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...