tag :: MapReduce | BibSonomy

bookmarks (hide)166
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Readings in Database Systems, 5th Edition (2015) | Hacker News
https://news.ycombinator.com/item?id=23514686
4 years ago by @bshanks
show all tags
aws
db
mapreduce
sql
awsdbmapreducesql
(0)
copydelete
- community post
- history of this post
1How-to: Tune MapReduce Parallelism in Apache Pig Jobs - Cloudera Engineering Blog
https://blog.cloudera.com/blog/2015/07/how-to-tune-mapreduce-parallelism-in-apache-pig-jobs/
5 years ago by @schmitz
show all tags
hadoop
mapreduce
pig
hadoopmapreducepig
(0)
copydelete
- community post
- history of this post
2MapReduce - MongoDB
http://www.mongodb.org/display/DOCS/MapReduce
8 years ago by @genbob
show all tags
mapreduce
mongodb
mapreducemongodb
(0)
copydelete
- community post
- history of this post
2About Hadoop
http://lucene.apache.org/hadoop/about.html
8 years ago by @bshanks
show all tags
Bookmarks
cluster
lucene
map
reduce
mapReduce
dfs
BookmarksclusterlucenemapreducemapReducedfs
(0)
copydelete
- community post
- history of this post
5Amazon Elastic MapReduce
http://aws.amazon.com/elasticmapreduce/
8 years ago by @bshanks
show all tags
hadoop
mapReduce
aws
amazon
ec2
s3
cloud
hadoopmapReduceawsamazonec2s3cloud
(0)
copydelete
- community post
- history of this post
1Raj's Blog (nlake44) - Cloud Nine
http://nlake44.posterous.com/
8 years ago by @bshanks
show all tags
gae
fantasm
mapReduce
pipeline
gaefantasmmapReducepipeline
(0)
copydelete
- community post
- history of this post
1Implementing Workflows on Google App Engine with Fantasm - Google App Engine - Google Code
http://code.google.com/appengine/articles/fantasm.html
8 years ago by @bshanks
show all tags
fantasm
gae
fsm
finitestatemachine
mapreduce
queue
workflow
fantasmgaefsmfinitestatemachinemapreducequeueworkflow
(0)
copydelete
- community post
- history of this post
1Migrating GAE Datastore Schema | Idea Machine
http://www.brankovukelic.com/post/6485562217/migrating-gae-datastore-schema
8 years ago by @bshanks
show all tags
gae
migrate
mapreduce
gaemigratemapreduce
(0)
copydelete
- community post
- history of this post
1Brisk – Apache Hadoop™ powered by Cassandra | DataStax
http://www.datastax.com/products/brisk
8 years ago by @bshanks
show all tags
hadoop
mapreduce
cassandra
hadoopmapreducecassandra
(0)
copydelete
- community post
- history of this post
1Distributed Keyword Search over RDF via MapReduce
http://2014.eswc-conferences.org/sites/default/files/papers/paper_133.pdf
8 years ago by @gpublio
show all tags
dataset
keyword
mapreduce
search
datasetkeywordmapreducesearch
(0)
copydelete
- community post
- history of this post
311. Determine YARN and MapReduce Memory Configuration Settings - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.0.6.0/bk_installing_manually_book/content/rpm-chap1-11.html
10 years ago by @jaeschke
show all tags
cluster
configuration
hadoop
mapreduce
memory
yarn
clusterconfigurationhadoopmapreducememoryyarn
(0)
copydelete
- community post
- history of this post
311. Determine YARN and MapReduce Memory Configuration Settings - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.0.6.0/bk_installing_manually_book/content/rpm-chap1-11.html
10 years ago by @becker
show all tags
configure
discs
hadoop
mapreduce
memory
vcores
yarn
configurediscshadoopmapreducememoryvcoresyarn
(0)
copydelete
- community post
- history of this post
1Data clustering using map reduce
Use of MapReduce paradigm to design Clustering algorithms
10 years ago by @kw
show all tags
2twitter
bigdata
clustering
datamining
mapreduce
2twitterbigdataclusteringdataminingmapreduce
(0)
copydelete
- community post
- history of this post
1The Elephant was a Trojan Horse: On the Death of Map-Reduce at Google : Paper Trail
Map-Reduce is on its way out. But we shouldn’t measure its importance in the number of bytes it crunches, but the fundamental shift in data processing architectures it helped popularise.
10 years ago by @jaeschke
show all tags
analysis
bigdata
data
dremel
drill
dryad
exploration
flow
google
mapreduce
processing
analysisbigdatadatadremeldrilldryadexplorationflowgooglemapreduceprocessing
(0)
copydelete
- community post
- history of this post
37 Deadly Hadoop Misconfigurations
http://archive.apachecon.com/na2013/presentations/27-Wednesday/Big_Data/14:45-7_Deadly_Hadoop_Misconfigurations-Kathleen_Ting/HadoopTroubleshootingApacheCon.pdf
10 years ago by @jaeschke
show all tags
bigdata
cluster
configuration
hadoop
l3s
mapreduce
bigdataclusterconfigurationhadoopl3smapreduce
(0)
copydelete
- community post
- history of this post
1Document Similarity Self-Join with MapReduce
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=5694030&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D5694030
10 years ago by @albinzehe
show all tags
DocumentSimilarity
MapReduce
ba-zehe
DocumentSimilarityMapReduceba-zehe
(0)
copydelete
- community post
- history of this post
1Cloud9: A Hadoop toolkit for working with big data
http://lintool.github.io/Cloud9/
11 years ago by @jaeschke
show all tags
cluster
hadoop
mapreduce
pagerank
tool
trec
warc
wikipedia
clusterhadoopmapreducepageranktooltrecwarcwikipedia
(0)
copydelete
- community post
- history of this post
2Mapreduce & Hadoop Algorithms in Academic Papers (4th update – May 2011)
http://atbrox.com/2011/05/16/mapreduce-hadoop-algorithms-in-academic-papers-4th-update-may-2011/
11 years ago by @thoni
show all tags
hadoop
mapreduce
publications
seminar
ss14
hadoopmapreducepublicationsseminarss14
(0)
copydelete
- community post
- history of this post
1Hadoop matrix transposistion
http://www.opensourceconnections.com/2013/03/14/finally-a-hadoop-hello-world-that-isnt-a-lame-word-count/
11 years ago by @daschloer
show all tags
Hadoop
MapReduce
example
matrix
transposistion
HadoopMapReduceexamplematrixtransposistion
(0)
copydelete
- community post
- history of this post
2MapReduce Hadoop algorithms in academic papers
http://atbrox.com/2011/05/16/mapreduce-hadoop-algorithms-in-academic-papers-4th-update-may-2011/
11 years ago by @daschloer
show all tags
Examples
Hadoop
MapReduce
ExamplesHadoopMapReduce
(0)
copydelete
- community post
- history of this post
1Rhipe / MapReduce / Hadoop Tutorial
http://ml.stat.purdue.edu/rhafen/rhipe/#setup
11 years ago by @praveen
show all tags
R
hadoop
mapreduce
rhipe
Rhadoopmapreducerhipe
(0)
copydelete
- community post
- history of this post
1Introduction to the Rossmann Hadoop Cluster
http://www.stat.purdue.edu/~sguha/rossmann.intro.html#sec-2
11 years ago by @praveen
show all tags
mapreduce
r
mapreducer
(0)
copydelete
- community post
- history of this post
1Writing An Hadoop MapReduce Program In Python - Michael G. Noll
How to write an Hadoop MapReduce program in Python with the Hadoop Streaming API
11 years ago by @praveen
show all tags
hadoop
mapreduce
python
hadoopmapreducepython
(0)
copydelete
- community post
- history of this post
2Parallel MapReduce in Python in Ten Minutes | Cvet's Blog
Almost everyone has heard of Google's MapReduce framework, but very few have ever hacked around with the idea of map and reduce. These two idioms are borrowed from functional programming, and form the basis of Google's framework. Although Python is not a functional programming language, it has built-in support for both of these concepts. A…
11 years ago by @praveen
show all tags
mapreduce
parallel
python
mapreduceparallelpython
(0)
copydelete
- community post
- history of this post
1MapReduce Python Tutorial
mapreduce python, mapreduce tutorial, mapreduce tutorial python, mrjob tutorial, python mrjob, tutorial mapreduce python, tutorial mrjob
11 years ago by @praveen
show all tags
mapreduce
python
mapreducepython
(0)
copydelete
- community post
- history of this post
3Apache Spark™ - Lightning-Fast Cluster Computing
https://spark.incubator.apache.org/
11 years ago by @nosebrain
show all tags
apache
cluster
computing
hadoop
mapreduce
spark
apacheclustercomputinghadoopmapreducespark
(0)
copydelete
- community post
- history of this post
4MapReduce - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/MapReduce
11 years ago by @yasirkhan
show all tags
africa
encyclopedia
mapreduce
wikipedia
africaencyclopediamapreducewikipedia
(0)
copydelete
- community post
- history of this post
4MapReduce - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/MapReduce
11 years ago by @aenikata
show all tags
mapreduce
nosql
mapreducenosql
(0)
copydelete
- community post
- history of this post
1Ask HN: To everybody who uses MapReduce: what problems do you solve? | Hacker News
https://news.ycombinator.com/item?id=6706545
11 years ago by @sac
show all tags
bigdate
mapreduce
usecase
bigdatemapreduceusecase
(0)
copydelete
- community post
- history of this post
1org.apache.hadoop.hbase.mapreduce (HBase 0.97.0-SNAPSHOT API)
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/package-summary.html#sink
11 years ago by @dbenz
show all tags
documentation
hbas
mapred
mapreduce
documentationhbasmapredmapreduce
(0)
copydelete
- community post
- history of this post
1hbase - Hadoop Mapreduce tasktrackers keep ignoring HADOOP_CLASSPATH. Zookeeper trying to connect to localhost rather than cluster address - Stack Overflow
http://stackoverflow.com/questions/18232473/hadoop-mapreduce-tasktrackers-keep-ignoring-hadoop-classpath-zookeeper-trying-t
11 years ago by @dbenz
show all tags
hbase
mapreduce
trouble
hbasemapreducetrouble
(0)
copydelete
- community post
- history of this post
1Translate SQL to MongoDB MapReduce • myNoSQL
http://nosql.mypopescu.com/post/392418792/translate-sql-to-mongodb-mapreduce
11 years ago by @sac
show all tags
cheatsheet
mapreduce
mongodb
query
sql
cheatsheetmapreducemongodbquerysql
(0)
copydelete
- community post
- history of this post
2Parallel MapReduce in Python in Ten Minutes | Cvet's Blog
a bookmark
11 years ago by @schmidt2
show all tags
mapreduce
parallel_computing
python
tutorial
mapreduceparallel_computingpythontutorial
(0)
copydelete
- community post
- history of this post
1Matrix methods for Hadoop
A quick tutorial on how to tackle problems from a matrix-vector perspective in Hadoop
12 years ago by @jaeschke
show all tags
hadoop
mapreduce
matrix
parallel
hadoopmapreducematrixparallel
(0)
copydelete
- community post
- history of this post
15The Julia Language
http://julialang.org/
12 years ago by @jaeschke
show all tags
cluster
hadoop
hpc
julia
language
mapreduce
programming
todo
clusterhadoophpcjulialanguagemapreduceprogrammingtodo
(0)
copydelete
- community post
- history of this post
1Mellanox Technologies: Hadoop
http://www.mellanox.com/content/pages.php?pg=hadoop
12 years ago by @jaeschke
show all tags
cluster
hadoop
hardware
mapreduce
clusterhadoophardwaremapreduce
(0)
copydelete
- community post
- history of this post
1Hadoop Tutorial - YDN
Module 7: Managing a Hadoop Cluster
12 years ago by @jaeschke
show all tags
cluster
hadoop
hardware
mapreduce
clusterhadoophardwaremapreduce
(0)
copydelete
- community post
- history of this post
2Cloudera’s Support Team Shares Some Basic Hardware Recommendations | Apache Hadoop for the Enterprise | Cloudera
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
12 years ago by @jaeschke
show all tags
cloudera
cluster
hadoop
hardware
mapreduce
clouderaclusterhadoophardwaremapreduce
(0)
copydelete
- community post
- history of this post
1How to Include Third-Party Libraries in Your Map-Reduce Job | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2011/01/how-to-include-third-party-libraries-in-your-map-reduce-job/
12 years ago by @dbenz
show all tags
hadoop
howto
library
libs
mapreduce
hadoophowtolibrarylibsmapreduce
(0)
copydelete
- community post
- history of this post
1Peregrine - Fast Map Reduce for Iterative Computation - YouTube
http://www.youtube.com/watch?feature=player_detailpage&v=-_iCo1fFSeQ&noredirect=1#t=14s
12 years ago by @dbenz
show all tags
bigdata
mapreduce
peregrine
talk
bigdatamapreduceperegrinetalk
(0)
copydelete
- community post
- history of this post
1High Scalability - High Scalability - Peregrine - A Map Reduce Framework for Iterative and Pipelined Jobs
http://highscalability.com/blog/2012/1/12/peregrine-a-map-reduce-framework-for-iterative-and-pipelined.html
12 years ago by @dbenz
show all tags
bigdata
facebook
mapreduce
peregrine
bigdatafacebookmapreduceperegrine
(0)
copydelete
- community post
- history of this post
1Peregrine: Home
Peregrine is a map reduce framework designed for running iterative jobs across partitions of data. Peregrine is designed to be FAST for executing map reduce jobs by supporting a number of optimizations and features not present in other map reduce frameworks.
12 years ago by @dbenz
show all tags
bigdata
facebook
mapreduce
peregrine
bigdatafacebookmapreduceperegrine
(0)
copydelete
- community post
- history of this post
1Under the Hood: Scheduling MapReduce jobs more efficiently with Corona
http://www.facebook.com/notes/facebook-engineering/under-the-hood-scheduling-mapreduce-jobs-more-efficiently-with-corona/10151142560538920
12 years ago by @dbenz
show all tags
bigdata
corona
explanation
facebook
mapreduce
bigdatacoronaexplanationfacebookmapreduce
(0)
copydelete
- community post
- history of this post
1Testing Hadoop Map Reduce jobs - Nube Technologies
Testing Hadoop Map Reduce jobs
12 years ago by @dbenz
show all tags
bigdata
hadoop
junit
mapreduce
mockito
qa
qs
testing
bigdatahadoopjunitmapreducemockitoqaqstesting
(0)
copydelete
- community post
- history of this post
2Apache MRUnit TM
Apache MRUnit ™ is a Java library that helps developers unit test Apache Hadoop map reduce jobs.
12 years ago by @dbenz
show all tags
bigdata
hadoop
mapreduce
mrunit
qa
qs
testing
bigdatahadoopmapreducemrunitqaqstesting
(0)
copydelete
- community post
- history of this post
1MRQL
MRQL (the Map-Reduce Query Language) is an SQL-like query language for map-reduce computations. It is implemented on top of Apache's Hadoop. MRQL is powerful enough to express most common data analysis tasks over many different kinds of raw data, including hierarchical data and nested collections, such as XML data. It is more powerful than other current languages, such as Hive and Pig Latin, since it can operate on more complex data and supports more powerful query constructs, thus eliminating the need for using explicit map-reduce code.
12 years ago by @sac
show all tags
hadoop
mapreduce
query
hadoopmapreducequery
(0)
copydelete
- community post
- history of this post
2MapReduce-book-final.pdf (application/pdf-Objekt)
http://www.umiacs.umd.edu/~jimmylin/MapReduce-book-final.pdf
12 years ago by @sac
show all tags
ebook
graph
hadoop
mapreduce
ebookgraphhadoopmapreduce
(0)
copydelete
- community post
- history of this post
1xadoop | Free software downloads at SourceForge.net
Xadoop is a project that combines XQuery and Hadoop. It aims to automatically parallelize a given XQuery program and run it on Hadoop.
12 years ago by @sac
show all tags
hadoop
mapreduce
xadoop
xml
xquery
hadoopmapreducexadoopxmlxquery
(0)
copydelete
- community post
- history of this post
2Apache MRUnit TM
http://mrunit.apache.org/
12 years ago by @nosebrain
show all tags
hadoop
junit
mapreduce
mrunit
hadoopjunitmapreducemrunit
(0)
copydelete
- community post
- history of this post
1CouchDB Vs MongoDB
http://www.slideshare.net/gabriele.lana/couchdb-vs-mongodb-2982288
12 years ago by @schmidt2
show all tags
couchdb
mapreduce
mongodb
slides
couchdbmapreducemongodbslides
(0)
copydelete
- community post
- history of this post
1Waving Not Drowning: Writing a custom PIG Loader
http://arunxjacob.blogspot.com/2010/12/writing-custom-pig-loader.html
13 years ago by @muehlburger
show all tags
customloader
hadoop
loader
loadfunc
mapreduce
mt
pig
customloaderhadooploaderloadfuncmapreducemtpig
(0)
copydelete
- community post
- history of this post
1mrjs - Browser-based distributed computing -- proof of concept / work in progress - Google Project Hosting
Browser-based distributed computing -- proof of concept / work in progress
13 years ago by @sac
show all tags
browser
client
computing
js
mapreduce
browserclientcomputingjsmapreduce
(0)
copydelete
- community post
- history of this post
1Papers in MapReduce Applications | Mendeley Group
A list of Group papers for MapReduce Applications. Articles include: 'Nephele: Genotyping via Complete Composition Vectors and MapReduce' by Marc E Colosimo, Matthew W Peterson, Scott Mardis et al., 'Clustering Very Large Multi-dimensional Datasets with MapReduce' by Robson L F Cordeiro, Julio López, Christos Faloutsos and 'Yahoo! Research Small World Experiment' by Yahoo!, Facebook
13 years ago by @sac
show all tags
applications
mapreduce
papers
applicationsmapreducepapers
(0)
copydelete
- community post
- history of this post
1boto: A Python interface to Amazon Web Services — boto v2.2.2-dev
http://boto.cloudhackers.com/en/latest/index.html
13 years ago by @muehlburger
show all tags
amazon
aws
boto
hadoop
mapreduce
mt
python
amazonawsbotohadoopmapreducemtpython
(0)
copydelete
- community post
- history of this post
3MapReduce Patterns, Algorithms, and Use Cases « Highly Scalable Blog
http://highlyscalable.wordpress.com/2012/02/01/mapreduce-patterns/
13 years ago by @muehlburger
show all tags
algorithms
design-patterns
hadoop
mapreduce
algorithmsdesign-patternshadoopmapreduce
(0)
copydelete
- community post
- history of this post
3MapReduce Patterns, Algorithms, and Use Cases « Highly Scalable Blog
http://highlyscalable.wordpress.com/2012/02/01/mapreduce-patterns/
13 years ago by @psinger
show all tags
blog
hadoop
mapreduce
toread
bloghadoopmapreducetoread
(0)
copydelete
- community post
- history of this post
1Lecture Slides — Systems @ ETH
http://www.systems.ethz.ch/education/past-courses/hs08/map-reduce/map-reduce/lecture-slides
13 years ago by @muehlburger
show all tags
eth
mapreduce
zuerich
ethmapreducezuerich
(0)
copydelete
- community post
- history of this post
1Networked Information Systems — Systems @ ETH
http://www.systems.ethz.ch/education/courses/fs09/NIS/index
13 years ago by @muehlburger
show all tags
eth-zürich
hadoop
mapreduce
tutorial
eth-zürichhadoopmapreducetutorial
(0)
copydelete
- community post
- history of this post
1Hadoop, Pig, and Twitter (NoSQL East 2009)
http://www.slideshare.net/kevinweil/hadoop-pig-and-twitter-nosql-east-2009
13 years ago by @muehlburger
show all tags
data
database
datamining
hadoop
mapreduce
programming
twitter
datadatabasedatamininghadoopmapreduceprogrammingtwitter
(0)
copydelete
- community post
- history of this post
1Natural Language Processing with Hadoop and Python « Cloudera » Apache Hadoop for the Enterprise
http://www.cloudera.com/blog/2010/03/natural-language-processing-with-hadoop-and-python/
13 years ago by @muehlburger
show all tags
ai
cloudera
hadoop
language
mapreduce
nlp
nltk
python
semantic
toread
aiclouderahadooplanguagemapreducenlpnltkpythonsemantictoread
(0)
copydelete
- community post
- history of this post
1Last.fm – the Blog · Python + Hadoop = Flying Circus Elephant
http://blog.last.fm/2008/05/29/python-hadoop-flying-circus-elephant
13 years ago by @muehlburger
show all tags
clustering
distributed
dumbo
hadoop
mapreduce
opensource
programming
python
scalability
scaling
clusteringdistributeddumbohadoopmapreduceopensourceprogrammingpythonscalabilityscaling
(0)
copydelete
- community post
- history of this post
1more.pdf (application/pdf-Objekt)
http://kheafield.com/professional/google/more.pdf
13 years ago by @muehlburger
show all tags
algorithm
clustering
datamining
filetype:pdf
hadoop
k-means
mapreduce
media:document
algorithmclusteringdataminingfiletype:pdfhadoopk-meansmapreducemedia:document
(0)
copydelete
- community post
- history of this post
1Karmasphere
http://www.karmasphere.com/
13 years ago by @muehlburger
show all tags
cloud
cloudcomputing
hadoop
ide
mapreduce
opensource
tools
cloudcloudcomputinghadoopidemapreduceopensourcetools
(0)
copydelete
- community post
- history of this post
3Dryad - Microsoft Research
http://research.microsoft.com/en-us/projects/dryad/
13 years ago by @muehlburger
show all tags
cloud
cluster
computing
data
distributed
dryad
grid
hadoop
mapreduce
microsoft
research
scalability
cloudclustercomputingdatadistributeddryadgridhadoopmapreducemicrosoftresearchscalability
(0)
copydelete
- community post
- history of this post
2MapReduce-book-final.pdf (application/pdf-Objekt)
http://www.umiacs.umd.edu/~jimmylin/MapReduce-book-final.pdf
13 years ago by @muehlburger
show all tags
book
filetype:pdf
hadoop
mapreduce
media:document
programming
bookfiletype:pdfhadoopmapreducemedia:documentprogramming
(0)
copydelete
- community post
- history of this post
1Distributed data processing with Hadoop, Part 1: Getting started
http://www.ibm.com/developerworks/linux/library/l-hadoop-1/
13 years ago by @muehlburger
show all tags
awm2010
cloud
cloudera
hadoop
mapreduce
awm2010cloudclouderahadoopmapreduce
(0)
copydelete
- community post
- history of this post
1Hadoop, BigData and Cassandra with Jonathan Ellis « All Things Hadoop
http://allthingshadoop.com/2010/05/17/hadoop-bigdata-cassandra-a-talk-with-jonathan-ellis/
13 years ago by @muehlburger
show all tags
awm2010
cassandra
cloudera
hadoop
mapreduce
nosql
awm2010cassandraclouderahadoopmapreducenosql
(0)
copydelete
- community post
- history of this post
1HIVE: Data Warehousing & Analytics on Hadoop
http://www.slideshare.net/zshao/hive-data-warehousing-analytics-on-hadoop-presentation
13 years ago by @muehlburger
show all tags
analytics
clustering
database
datamining
facebook
hadoop
hive
mapreduce
presentations
scalability
technology
analyticsclusteringdatabasedataminingfacebookhadoophivemapreducepresentationsscalabilitytechnology
(0)
copydelete
- community post
- history of this post
1Sanjay Sharma’s Weblog
http://indoos.wordpress.com/
13 years ago by @muehlburger
show all tags
business-intelligence
hadoop
mapreduce
weblog
business-intelligencehadoopmapreduceweblog
(0)
copydelete
- community post
- history of this post
1The LinkedIn Blog » LinkedIn, Apache Pig, and Open Source «
http://blog.linkedin.com/2010/07/01/linkedin-apache-pig/
13 years ago by @muehlburger
show all tags
hadoop
mapreduce
pig
hadoopmapreducepig
(0)
copydelete
- community post
- history of this post
1Data Recipes: Graph Processing With Apache Pig
http://thedatachef.blogspot.com/2011/01/graph-processing-with-apache-pig.html
13 years ago by @muehlburger
show all tags
data
graph
hadoop
mapreduce
pig
r
datagraphhadoopmapreducepigr
(0)
copydelete
- community post
- history of this post
1Gilt
http://www.10gen.com/conferences/mongosf2010#gilt
13 years ago by @muehlburger
show all tags
ecommerce
hummingbird
mapreduce
mongodb
ecommercehummingbirdmapreducemongodb
(0)
copydelete
- community post
- history of this post
3MapReduce Patterns, Algorithms, and Use Cases « Highly Scalable
http://highlyscalable.wordpress.com/2012/02/01/mapreduce-patterns/
13 years ago by @sac
show all tags
algorithms
mapreduce
patterns
algorithmsmapreducepatterns
(0)
copydelete
- community post
- history of this post
1SHARD Triple-Store | Download SHARD Triple-Store software for free at SourceForge.net
http://sourceforge.net/projects/shard-3store/
13 years ago by @zazi
show all tags
HDFS
Hadoop
MapReduce
SHARD
Scaling
Semantic_Web
Semantic_Web_Technology
Triple_Store
HDFSHadoopMapReduceSHARDScalingSemantic_WebSemantic_Web_TechnologyTriple_Store
(0)
copydelete
- community post
- history of this post
2SHARD
http://www.dist-systems.bbn.com/people/krohloff/shard.shtml
13 years ago by @zazi
show all tags
HDFS
Hadoop
MapReduce
SHARD
Semantic_Web
Semantic_Web_Technology
Triple_Store
HDFSHadoopMapReduceSHARDSemantic_WebSemantic_Web_TechnologyTriple_Store
(0)
copydelete
- community post
- history of this post
7Cascading
http://www.cascading.org/
13 years ago by @myhlow
show all tags
cascading
hadoop
mapreduce
cascadinghadoopmapreduce
(0)
copydelete
- community post
- history of this post
1Tom White: Learning MapReduce
http://www.lexemetech.com/2008/03/learning-mapreduce.html
13 years ago by @draganigajic
show all tags
cloudComputing
distributed
hadoop
java
mapReduce
tutorial
cloudComputingdistributedhadoopjavamapReducetutorial
(0)
copydelete
- community post
- history of this post
1Tv's cobweb: Incremental mapreduce
http://eagain.net/articles/incremental-mapreduce/
13 years ago by @draganigajic
show all tags
100+
cs
mapReduce
100+csmapReduce
(0)
copydelete
- community post
- history of this post
1MapReduce II - The Database Column
http://www.databasecolumn.com/2008/01/mapreduce-continued.html
13 years ago by @draganigajic
show all tags
db
mapReduce
dbmapReduce
(0)
copydelete
- community post
- history of this post
4MapReduce: A major step backwards - The Database Column
http://www.databasecolumn.com/2008/01/mapreduce-a-major-step-back.html
13 years ago by @draganigajic
show all tags
100+
article
critique
db
mapReduce
100+articlecritiquedbmapReduce
(0)
copydelete
- community post
- history of this post
1Hadoop Summit and Data-Intensive Computing Symposium Videos and Slides | Yahoo! Research
http://research.yahoo.com/node/2104
13 years ago by @draganigajic
show all tags
100+
cloudComputing
hadoop
java
mapReduce
presentation
scalability
slides
video
yahoo
100+cloudComputinghadoopjavamapReducepresentationscalabilityslidesvideoyahoo
(0)
copydelete
- community post
- history of this post
1Sawzall - a popular language at Google | Lambda the Ultimate
http://lambda-the-ultimate.org/node/916
13 years ago by @draganigajic
show all tags
google
ltu
mapReduce
parallel
pl
sawzall
googleltumapReduceparallelplsawzall
(0)
copydelete
- community post
- history of this post
4Welcome to Pig!
Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: * Ease of programming. It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. * Optimization opportunities. The way in which tasks are encoded permits the system to optimize their execution automatically * Extensibility.
13 years ago by @draganigajic
show all tags
apache
datamining
hadoop
java
mapreduce
apachedatamininghadoopjavamapreduce
(0)
copydelete
- community post
- history of this post
5Disco
Disco is an oss implementation of the Map-Reduce framework for distributed computing. Disco supports parallel computations over large data sets on unreliable cluster of computers. The Disco core is written in Erlang. Users of Disco typically write jobs in Python, which makes it possible to express even complex algorithms or data processing tasks often only in tens of lines of code. This means that you can quickly write scripts to process massive amounts of data. Disco was started at Nokia Research Center as a lightweight framework for rapid scripting of distributed data processing tasks. This far Disco has been succesfully used, for instance, in parsing and reformatting data, data clustering, probabilistic modelling, data mining, full-text indexing, and log analysis with hundreds of gigabytes of real-world data. Linux is the only supported platform but you can run Disco in the Amazon's Elastic Computing Cloud.
13 years ago by @draganigajic
show all tags
100+
EC2
concurrency
erlang
framework
mapreduce
python
scalability
100+EC2concurrencyerlangframeworkmapreducepythonscalability
(0)
copydelete
- community post
- history of this post
1LinksToSlides | Berlin Buzzwords 2011 - Search, Store, Scale
http://berlinbuzzwords.de/node/748
13 years ago by @schmitz
show all tags
2011
bigdata
buzzwords
conference
hadoop
java
mapreduce
presentation
slides
2011bigdatabuzzwordsconferencehadoopjavamapreducepresentationslides
(0)
copydelete
- community post
- history of this post
2MapReduce - MongoDB
http://www.mongodb.org/display/DOCS/MapReduce
13 years ago by @nosebrain
show all tags
mapreduce
mongodb
mapreducemongodb
(0)
copydelete
- community post
- history of this post
1mongodb/mongo-hadoop - GitHub
https://github.com/mongodb/mongo-hadoop
13 years ago by @nosebrain
show all tags
adapter
hadoop
mapreduce
mongodb
repository
adapterhadoopmapreducemongodbrepository
(0)
copydelete
- community post
- history of this post
2Welcome to Hive!
http://hive.apache.org/
13 years ago by @nosebrain
show all tags
hadoop
hive
language
mapreduce
query
sqllike
hadoophivelanguagemapreducequerysqllike
(0)
copydelete
- community post
- history of this post
1Hadoop Tutorial - YDN
http://developer.yahoo.com/hadoop/tutorial/module5.html#types
13 years ago by @stroeh
show all tags
custom
data
hadoop
mapreduce
tutorial
type
yahoo
customdatahadoopmapreducetutorialtypeyahoo
(0)
copydelete
- community post
- history of this post
2Search Hadoop
http://search-hadoop.com/
13 years ago by @stroeh
show all tags
hadoop
information
information-searchengine
mapreduce
hadoopinformationinformation-searchenginemapreduce
(0)
copydelete
- community post
- history of this post
3octo.py: quick and easy MapReduce for Python
http://ebiquity.umbc.edu/blogger/2009/01/02/octopy-quick-and-easy-mapreduce-for-python/
14 years ago by @brightbyte
show all tags
mapreduce
python
mapreducepython
(0)
copydelete
- community post
- history of this post
1Python Cloud: Google's MapReduce in 98 Lines of Python
http://clouddbs.blogspot.com/2010/10/googles-mapreduce-in-98-lines-of-python.html
14 years ago by @brightbyte
show all tags
mapreduce
python
mapreducepython
(0)
copydelete
- community post
- history of this post
5Jimmy Lin � Data-Intensive Text Processing with MapReduce
http://www.umiacs.umd.edu/~jimmylin/book.html
14 years ago by @schmitz
show all tags
book
download
free
mapreduce
textmining
bookdownloadfreemapreducetextmining
(0)
copydelete
- community post
- history of this post
7Cascading
Cascading is a Data Processing API, Process Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on an Apache Hadoop cluster. All without having to 'think' in MapReduce. Cascading is a thin Java library and API that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application. As a library and API that can be driven from any JVM based language (Jython, JRuby, Groovy, Clojure, etc.), developers can create applications and frameworks that are "operationalized". That is, a single deployable Jar can be used to encapsulate a series of complex and dynamic processes all driven from the command line or a shell. Instead of using external schedulers to glue many individual applications together with XML against each individual command line interface. The Cascading API approach dramatically simplifies development, regression and integration testing, and deployment of business critical applications on both Amazon Web Services (like Elastic MapReduce) or on dedicated hardware. Cascading is not a new text based query syntax (like Pig) or another complex system that must be installed on a cluster and maintained (like Hive). But Cascading is both complimentary and a valid alternative to either application.
14 years ago by @gresch
show all tags
develop
framework
hadoop
jvm
mapreduce
software
developframeworkhadoopjvmmapreducesoftware
(0)
copydelete
- community post
- history of this post
3MapReduce – Wikipedia
http://de.wikipedia.org/wiki/MapReduce
14 years ago by @stroeh
show all tags
hadoop
mapreduce
nosql
hadoopmapreducenosql
(0)
copydelete
- community post
- history of this post
1MapReduce - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Mapreduce
14 years ago by @stroeh
show all tags
hadoop
mapreduce
nosql
hadoopmapreducenosql
(0)
copydelete
- community post
- history of this post
3Linux-Magazin Online
Schnell, robust, einfach zu nutzen, skalierbar, weit einsetzbar und inklusive Monitoring: Das verspricht MapReduce, ein Framework von Google zur nebenläufigen Berechnung sehr großer Datenmengen auf Rechnerclustern. Ein mutiges Versprechen. Dieser Artikel wird zeigen, ob MapReduce es einlöst.
14 years ago by @tgunkel
show all tags
diplom
hadoop
mapreduce
tutorial
diplomhadoopmapreducetutorial
(0)
copydelete
- community post
- history of this post
1Hadoop Tutorial - YDN
Module 4: MapReduce
14 years ago by @tgunkel
show all tags
hadoop
mapreduce
tutorial
hadoopmapreducetutorial
(0)
copydelete
- community post
- history of this post
1Hadoop Default Ports Quick Reference « Cloudera » Apache Hadoop for the Enterprise
http://www.cloudera.com/blog/2009/08/hadoop-default-ports-quick-reference/
14 years ago by @schmitz
show all tags
hadoop
java
mapreduce
port
reference
hadoopjavamapreduceportreference
(0)
copydelete
- community post
- history of this post
3Funktionale Programmierung: Das MapReduce-Framework
Description of MapReduce framework with a petabyte of data.
15 years ago by @m_aster
show all tags
awm2010
awmhadoop
mapreduce
awm2010awmhadoopmapreduce
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
⟩
⟩⟩

publications (hide)74
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1LARGE-SCALE DATA PROCESSING USING MAPREDUCE IN CLOUD COMPUTING ENVIRONMENT
S. Daneshyar, and M. Razmjoo. International Journal on Web Service Computing (IJWSC), 3 (4): 01-13 (December 2012)
12 days ago by @ijwsc
show all tags
Hadoop
MapReduce
and
cloud
computing
distributed
parallel
processing
HadoopMapReduceandcloudcomputingdistributedparallelprocessing
(0)
copydeleteadd this publication to your clipboard
1CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON MAPREDUCE FRAMEWORK
S. Gole, and B. Tidke. International Journal on Foundations of Computer Science & Technology (IJFCST), 5 (3): 11 (May 2015)
a year ago by @devino
show all tags
Association
Big
Clustering
Data
Frequent
Itemset
MapReduce
Mining
Rule
AssociationBigClusteringDataFrequentItemsetMapReduceMiningRule
(0)
copydeleteadd this publication to your clipboard
25MapReduce: Simplified Data Processing on Large Clusters
J. Dean, and S. Ghemawat. page Pp. 137-150 of the Proceedings. (2004)
3 years ago by @alisahhh
show all tags
mapreduce
mapreduce
(0)
copydeleteadd this publication to your clipboard
2Sequential Exceptional Pattern Discovery Using Pattern-Growth: An Extensible Framework for Interpretable Machine Learning on Sequential Data
D. Mollenhauer, and M. Atzmueller. (2020)
4 years ago by @becker
show all tags
pattern
mining
gp
growth
prefix
span
parallel
distributed
exceptional
model
sequence
sequential
discovery
emm
map
reduce
mapreduce
spark
patternmininggpgrowthprefixspanparalleldistributedexceptionalmodelsequencesequentialdiscoveryemmmapreducemapreducespark
(0)
copydeleteadd this publication to your clipboard
1Optimising parallel R correlation matrix calculations on gene expression data using MapReduce
S. Wang, I. Pandis, D. Johnson, I. Emam, F. Guitton, A. Oehmichen, and Y. Guo. BMC bioinformatics, 15 (1): 1--9 (2014)
4 years ago by @becker
show all tags
paper:fastcor
parallel
mapreduce
citedby:scholar:count:27
citedby:scholar:timestamp:2020-12-22
paper:fastcorparallelmapreducecitedby:scholar:count:27citedby:scholar:timestamp:2020-12-22
(0)
copydeleteadd this publication to your clipboard
2A Review Paper on Big Data and Hadoop for Data Science
M. Tandel. International Journal of Trend in Scientific Research and Development, 4 (1): 1216-1221 (December 2019)
5 years ago by @ijtsrd
show all tags
BigData
DataMiining
HDFS
Hadoop
HadoopComponents
MapReduce
BigDataDataMiiningHDFSHadoopHadoopComponentsMapReduce
(0)
copydeleteadd this publication to your clipboard
1Next Generation Sequencing in Big Data
C. N. International Journal of Trend in Scientific Research and Development, 2 (4): 379-389 (June 2018)
6 years ago by @ijtsrd
show all tags
Bi
Bioinformatics
DNA
Generation
MapReduce
Next
Sanger
Sequencing
Strands
analytics
data
model
BiBioinformaticsDNAGenerationMapReduceNextSangerSequencingStrandsanalyticsdatamodel
(0)
copydeleteadd this publication to your clipboard
7SparkTrails: A MapReduce Implementation of HypTrails for Comparing Hypotheses About Human Trails
M. Becker, H. Mewes, A. Hotho, D. Dimitrov, F. Lemmerich, and M. Strohmaier. International Conference Companion on World Wide Web, page 17--18. Republic and Canton of Geneva, Switzerland, International World Wide Web Conferences Steering Committee, (2016)
7 years ago by @becker
show all tags
apache
bayes
bayesian
behavior
chain
computing
diss
diss:allmypubs
distributed
factor
human
hypotheses
inthesis
mapreduce
markov
myown
paths
sequences
sequential
spark
statistics
trails
web
apachebayesbayesianbehaviorchaincomputingdissdiss:allmypubsdistributedfactorhumanhypothesesinthesismapreducemarkovmyownpathssequencessequentialsparkstatisticstrailsweb
(0)
copydeleteadd this publication to your clipboard
27MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
7 years ago by @genbob
show all tags
mapreduce
mongodb
mapreducemongodb
(0)
copydeleteadd this publication to your clipboard
27MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
7 years ago by @becker
show all tags
citedby:scholar:count:20792
citedby:scholar:timestamp:2017-5-16
diss
inthesis
mapreduce
paper:fastcor
sparktrails
citedby:scholar:count:20792citedby:scholar:timestamp:2017-5-16dissinthesismapreducepaper:fastcorsparktrails
(0)
copydeleteadd this publication to your clipboard
3Exhaustive search algorithms to mine subgroups on Big Data using Apache Spark
F. Padillo, J. Luna, and S. Ventura. Progress in Artificial Intelligence, (2017)
8 years ago by @becker
show all tags
distributed
emm
mapreduce
parallel
spark
subgroup
subgroups
distributedemmmapreduceparallelsparksubgroupsubgroups
(0)
copydeleteadd this publication to your clipboard
1An RDF Metadata-Based Weighted Semantic Pagerank Algorithm
H. Hee-Gook Jun, Dong-Hyuk Im Kim. International Journal of Web & Semantic Technology (IJWesT), 7 (2): 11-24 (April 2016)
8 years ago by @laimbee
show all tags
Big
Data,
MapReduce
PageRank,
RDF,
Semantic
Web,
BigData,MapReducePageRank,RDF,SemanticWeb,
(0)
copydeleteadd this publication to your clipboard
7SparkTrails: A MapReduce Implementation of HypTrails for Comparing Hypotheses About Human Trails.
M. Becker, H. Mewes, A. Hotho, D. Dimitrov, F. Lemmerich, and M. Strohmaier. WWW (Companion Volume), page 17-18. ACM, (2016)
8 years ago by @hotho
show all tags
2016
big
data
hyptrails
implementation
mapreduce
myown
sparktrails
2016bigdatahyptrailsimplementationmapreducemyownsparktrails
(0)
copydeleteadd this publication to your clipboard
1Privacy Preservation in Analyzing EHealth Records in Big Data Environment
E. Srimathi, and K. Apoorva. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (4): 2421--2427 (April 2015)
9 years ago by @ijritcc
show all tags
Anonymity
Anonymization
BigData
Data
Down
MapReduce
Specialization
Top
k
AnonymityAnonymizationBigDataDataDownMapReduceSpecializationTopk
(0)
copydeleteadd this publication to your clipboard
1Churn Prediction using MapReduce and HBase
G. Limaye, J. Chaudhary, and P. Punjabi. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (3): 1699--1703 (March 2015)
9 years ago by @ijritcc
show all tags
C4.5
Churn
HBase
Hadoop
MapReduce
Prediction
C4.5ChurnHBaseHadoopMapReducePrediction
(0)
copydeleteadd this publication to your clipboard
27MapReduce: Simplified Data Processing on Large Clusters
J. Dean, and S. Ghemawat. Commun. ACM, 51 (1): 107--113 (January 2008)
10 years ago by @asmelash
show all tags
bigdata
mapreduce
methods
phd
phdproposal
bigdatamapreducemethodsphdphdproposal
(0)
copydeleteadd this publication to your clipboard
3Symbolic State Space Exploration of RT Systems in the Cloud
C. Bellettini, M. Camilli, L. Capra, and M. Monga. Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2012 14th International Symposium on, page 295-302. IEEE Computer Society, (September 2012)
10 years ago by @carlobellettini
show all tags
analysis
cloud
mapreduce
petrinet
analysiscloudmapreducepetrinet
(0)
copydeleteadd this publication to your clipboard
3MaRDiGraS: Simplified Building of Reachability Graphs on Large Clusters
C. Bellettini, M. Camilli, L. Capra, and M. Monga. Reachability Problems, volume 8169 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2013)
10 years ago by @carlobellettini
show all tags
analysis
cloud
mapreduce
petrinet
analysiscloudmapreducepetrinet
(0)
copydeleteadd this publication to your clipboard
1Fast Detection of Connected Components in Large Scale Graphs Using MapReduce
M. Ali Varamesh1. IOSR Journal of Engineering (IOSRJEN), (February 2014)
11 years ago by @agibhardt
show all tags
CC-MR
Connected
Graph
Mapreduce
Pegasus
components
connected
CC-MRConnectedGraphMapreducePegasuscomponentsconnected
(0)
copydeleteadd this publication to your clipboard
27MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
11 years ago by @thoni
show all tags
mapreduce
seminar
ss2014
thema
mapreduceseminarss2014thema
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩