tag :: MapReduce | BibSonomy

bookmarks (hide)166
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1yahoo's hadoop at yahoo-hadoop-0.20 - GitHub
http://github.com/yahoo/hadoop/tree/yahoo-hadoop-0.20
15 years ago by @lystrata
show all tags
github
mapreduce
yahoo
hadoop
githubmapreduceyahoohadoop
copydelete
- community post
- history of this post
1xadoop | Free software downloads at SourceForge.net
Xadoop is a project that combines XQuery and Hadoop. It aims to automatically parallelize a given XQuery program and run it on Hadoop.
12 years ago by @sac
show all tags
mapreduce
xml
hadoop
xquery
xadoop
mapreducexmlhadoopxqueryxadoop
copydelete
- community post
- history of this post
1Writing An Hadoop MapReduce Program In Python - Michael G. Noll
How to write an Hadoop MapReduce program in Python with the Hadoop Streaming API
11 years ago by @praveen
show all tags
python
mapreduce
hadoop
pythonmapreducehadoop
copydelete
- community post
- history of this post
3Welcome to Pig!
Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. At the present time, Pig's infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs, for which large-scale parallel implementations already exist Pig's language layer currently consists of a textual language called Pig Latin, which has the following key properties: * Ease of programming. It is trivial to achieve parallel execution of simple, "embarrassingly parallel" data analysis tasks. * Optimization opportunities. The way in which tasks are encoded permits the system to optimize their execution automatically * Extensibility.
13 years ago by @draganigajic
show all tags
apache
java
datamining
mapreduce
hadoop
apachejavadataminingmapreducehadoop
copydelete
- community post
- history of this post
2Welcome to Hive!
http://hive.apache.org/
13 years ago by @nosebrain
show all tags
sqllike
language
mapreduce
query
hadoop
hive
sqllikelanguagemapreducequeryhadoophive
copydelete
- community post
- history of this post
6Welcome to Hadoop!
Hadoop is a software platform lets one easily write and run applications that process vast amounts of data.
17 years ago by @lystrata
show all tags
mapreduce
distributed
hadoop
search
filesystem
mapreducedistributedhadoopsearchfilesystem
copydelete
- community post
- history of this post
14Welcome to Apache Hadoop!
http://hadoop.apache.org/
16 years ago by @dolefulrabbit
show all tags
system
file
computing
mapreduce
distributed
hadoop
systemfilecomputingmapreducedistributedhadoop
copydelete
- community post
- history of this post
14Welcome to Apache Hadoop!
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
16 years ago by @carlfischer
show all tags
apache
java
cluster
grid
mapreduce
distributed
opensource
hadoop
apachejavaclustergridmapreducedistributedopensourcehadoop
copydelete
- community post
- history of this post
1Waving Not Drowning: Writing a custom PIG Loader
http://arunxjacob.blogspot.com/2010/12/writing-custom-pig-loader.html
13 years ago by @muehlburger
show all tags
loadfunc
customloader
mt
mapreduce
loader
hadoop
pig
loadfunccustomloadermtmapreduceloaderhadooppig
copydelete
- community post
- history of this post
1University of Washington: Problem Solving on Large Scale Clusters - Google Code University - Google Code
http://code.google.com/edu/submissions/uwspr2007_clustercourse/listing.html
17 years ago by @beate
show all tags
mapreduce
lectures
google
mapreducelecturesgoogle
copydelete
- community post
- history of this post
1Under the Hood: Scheduling MapReduce jobs more efficiently with Corona
http://www.facebook.com/notes/facebook-engineering/under-the-hood-scheduling-mapreduce-jobs-more-efficiently-with-corona/10151142560538920
12 years ago by @dbenz
show all tags
bigdata
facebook
mapreduce
corona
explanation
bigdatafacebookmapreducecoronaexplanation
copydelete
- community post
- history of this post
1Tv's cobweb: Incremental mapreduce
http://eagain.net/articles/incremental-mapreduce/
13 years ago by @draganigajic
show all tags
mapReduce
100+
cs
mapReduce100+cs
copydelete
- community post
- history of this post
1Translate SQL to MongoDB MapReduce • myNoSQL
http://nosql.mypopescu.com/post/392418792/translate-sql-to-mongodb-mapreduce
11 years ago by @sac
show all tags
cheatsheet
mapreduce
query
sql
mongodb
cheatsheetmapreducequerysqlmongodb
copydelete
- community post
- history of this post
1Tom White: Learning MapReduce
http://www.lexemetech.com/2008/03/learning-mapreduce.html
13 years ago by @draganigajic
show all tags
java
distributed
tutorial
mapReduce
hadoop
cloudComputing
javadistributedtutorialmapReducehadoopcloudComputing
copydelete
- community post
- history of this post
4The Perils of JavaSchools - Joel on Software
The lucky kids of JavaSchools are never going to get weird segfaults trying to implement pointer-based hash tables. They're never going to go stark, raving mad trying to pack things into bits. They'll never have to get their head around how, in a purely functional program, the value of a variable never changes.
18 years ago by @cschenk
show all tags
java
functional
development
writings
mapreduce
programming
javafunctionaldevelopmentwritingsmapreduceprogramming
copydelete
- community post
- history of this post
1The LinkedIn Blog » LinkedIn, Apache Pig, and Open Source «
http://blog.linkedin.com/2010/07/01/linkedin-apache-pig/
13 years ago by @muehlburger
show all tags
mapreduce
hadoop
pig
mapreducehadooppig
copydelete
- community post
- history of this post
11The Julia Language
http://julialang.org/
12 years ago by @jaeschke
show all tags
cluster
todo
julia
language
mapreduce
hadoop
hpc
programming
clustertodojulialanguagemapreducehadoophpcprogramming
copydelete
- community post
- history of this post
1The Elephant was a Trojan Horse: On the Death of Map-Reduce at Google : Paper Trail
Map-Reduce is on its way out. But we shouldn’t measure its importance in the number of bytes it crunches, but the fundamental shift in data processing architectures it helped popularise.
10 years ago by @jaeschke
show all tags
bigdata
exploration
dremel
data
mapreduce
processing
dryad
google
analysis
drill
flow
bigdataexplorationdremeldatamapreduceprocessingdryadgoogleanalysisdrillflow
copydelete
- community post
- history of this post
1Testing Hadoop Map Reduce jobs - Nube Technologies
Testing Hadoop Map Reduce jobs
12 years ago by @dbenz
show all tags
bigdata
junit
qs
mapreduce
testing
hadoop
qa
mockito
bigdatajunitqsmapreducetestinghadoopqamockito
copydelete
- community post
- history of this post
1Sqoop « Cloudera » Apache Hadoop for the Enterprise
Sqoop is a tool designed to import data from relational databases into Hadoop. Sqoop uses JDBC to connect to a database. It examines each table’s schema and automatically generates the necessary classes to import data into the Hadoop Distributed File System (HDFS). Sqoop then creates and launches a MapReduce job to read tables from the database via DBInputFormat, the JDBC-based InputFormat. Tables are read into a set of files in HDFS. Sqoop supports both SequenceFile and text-based target and includes performance enhancements for loading data from MySQL.
15 years ago by @gresch
show all tags
apache
java
software
mapreduce
hdfs
hadoop
sql
dbms
db
apachejavasoftwaremapreducehdfshadoopsqldbmsdb
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)74
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Web-scale distributional similarity and entity set expansion
P. Pantel, E. Crestan, A. Borkovsky, A. Popescu, and V. Vyas. EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, page 938--947. Morristown, NJ, USA, Association for Computational Linguistics, (2009)
14 years ago by @tgunkel
show all tags
mapreduce
similarity
hadoop
diplom
mapreducesimilarityhadoopdiplom
copydeleteadd this publication to your clipboard
2Towards scalable RDF graph analytics on MapReduce
P. Ravindra, V. Deshpande, and K. Anyanwu. MDAC '10: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, page 1--6. New York, NY, USA, ACM, (2010)
15 years ago by @muehlburger
show all tags
scalable
rdf
analytics
awmhadoop
awm2010
mapreduce
hadoop
graph
scalablerdfanalyticsawmhadoopawm2010mapreducehadoopgraph
copydeleteadd this publication to your clipboard
3Symbolic State Space Exploration of RT Systems in the Cloud
C. Bellettini, M. Camilli, L. Capra, and M. Monga. Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2012 14th International Symposium on, page 295-302. IEEE Computer Society, (September 2012)
10 years ago by @carlobellettini
show all tags
cloud
petrinet
mapreduce
analysis
cloudpetrinetmapreduceanalysis
copydeleteadd this publication to your clipboard
3Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce
M. Husain, P. Doshi, L. Khan, and B. Thuraisingham. Cloud Computing, 5931, chapter 72, Springer Berlin Heidelberg, Berlin, Heidelberg, (2009)
14 years ago by @flowolf
show all tags
hadoop-group
retrieval
rdf
awmhadoop
awm2010
hadoop
storage
MapReduce
graph
hadoop-groupretrievalrdfawmhadoopawm2010hadoopstorageMapReducegraph
copydeleteadd this publication to your clipboard
3Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce
M. Husain, P. Doshi, L. Khan, and B. Thuraisingham. Cloud Computing, 5931, chapter 72, Springer Berlin Heidelberg, Berlin, Heidelberg, (2009)
15 years ago by @muehlburger
show all tags
retrieval
rdf
awmhadoop
awm2010
hadoop
MapReduce
graph
storagea
retrievalrdfawmhadoopawm2010hadoopMapReducegraphstoragea
copydeleteadd this publication to your clipboard
1State Space Exploration of RT Systems in the Cloud
C. Bellettini, M. Camilli, L. Capra, and M. Monga. CoRR, (2012)
12 years ago by @carlobellettini
show all tags
cloud
petrinet
mapreduce
analysis
cloudpetrinetmapreduceanalysis
copydeleteadd this publication to your clipboard
7SparkTrails: A MapReduce Implementation of HypTrails for Comparing Hypotheses About Human Trails.
M. Becker, H. Mewes, A. Hotho, D. Dimitrov, F. Lemmerich, and M. Strohmaier. WWW (Companion Volume), page 17-18. ACM, (2016)
8 years ago by @hotho
show all tags
hyptrails
myown
implementation
2016
data
mapreduce
sparktrails
big
hyptrailsmyownimplementation2016datamapreducesparktrailsbig
copydeleteadd this publication to your clipboard
7SparkTrails: A MapReduce Implementation of HypTrails for Comparing Hypotheses About Human Trails
M. Becker, H. Mewes, A. Hotho, D. Dimitrov, F. Lemmerich, and M. Strohmaier. International Conference Companion on World Wide Web, page 17--18. Republic and Canton of Geneva, Switzerland, International World Wide Web Conferences Steering Committee, (2016)
7 years ago by @becker
show all tags
myown
bayesian
chain
sequential
distributed
trails
diss
sequences
apache
web
inthesis
spark
diss:allmypubs
computing
paths
bayes
mapreduce
human
factor
hypotheses
statistics
markov
behavior
myownbayesianchainsequentialdistributedtrailsdisssequencesapachewebinthesissparkdiss:allmypubscomputingpathsbayesmapreducehumanfactorhypothesesstatisticsmarkovbehavior
copydeleteadd this publication to your clipboard
3Smart Miner: a new framework for mining large scale web usage data
M. Bayir, I. Toroslu, A. Cosar, and G. Fidan. WWW '09: Proceedings of the 18th international conference on World wide web, page 161--170. New York, NY, USA, ACM, (2009)
15 years ago by @muehlburger
show all tags
framework
awmhadoop
mining
large
awm2010
mapreduce
hadoop
miner
smart
frameworkawmhadoopmininglargeawm2010mapreducehadoopminersmart
copydeleteadd this publication to your clipboard
2Sequential Exceptional Pattern Discovery Using Pattern-Growth: An Extensible Framework for Interpretable Machine Learning on Sequential Data
D. Mollenhauer, and M. Atzmueller. (2020)
4 years ago by @becker
show all tags
reduce
exceptional
emm
prefix
sequential
gp
distributed
discovery
sequence
mining
parallel
spark
pattern
mapreduce
map
growth
model
span
reduceexceptionalemmprefixsequentialgpdistributeddiscoverysequenceminingparallelsparkpatternmapreducemapgrowthmodelspan
copydeleteadd this publication to your clipboard
7Scalable Distributed Reasoning Using MapReduce.
J. Urbani, S. Kotoulas, E. Oren, and F. van Harmelen. International Semantic Web Conference, volume 5823 of Lecture Notes in Computer Science, page 634-649. Springer, (2009)
15 years ago by @danielt
show all tags
scaling
reasoning
mapreduce
scalingreasoningmapreduce
copydeleteadd this publication to your clipboard
1Privacy Preservation in Analyzing EHealth Records in Big Data Environment
E. Srimathi, and K. Apoorva. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (4): 2421--2427 (April 2015)
9 years ago by @ijritcc
show all tags
Down
Specialization
Top
Anonymization
Data
BigData
k
Anonymity
MapReduce
DownSpecializationTopAnonymizationDataBigDatakAnonymityMapReduce
copydeleteadd this publication to your clipboard
2Parallel Sorted Neighborhood Blocking with MapReduce
L. Kolb, A. Thor, and E. Rahm. Database Systems for Business, Technology, and Web Conference, (2011)
14 years ago by @arvid.heise
show all tags
sorted
parallel
mapreduce
neighborhood
sortedparallelmapreduceneighborhood
copydeleteadd this publication to your clipboard
1Optimising parallel R correlation matrix calculations on gene expression data using MapReduce
S. Wang, I. Pandis, D. Johnson, I. Emam, F. Guitton, A. Oehmichen, and Y. Guo. BMC bioinformatics, 15 (1): 1--9 (2014)
4 years ago by @becker
show all tags
citedby:scholar:timestamp:2020-12-22
parallel
mapreduce
citedby:scholar:count:27
paper:fastcor
citedby:scholar:timestamp:2020-12-22parallelmapreducecitedby:scholar:count:27paper:fastcor
copydeleteadd this publication to your clipboard
2NIMBLE: A Toolkit for the Implementation of Parallel Data Mining and Machine Learning Algorithms on MapReduce
A. Ghoting, P. Kambadur, E. Pednault, and R. Kannan. Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011, page 334-342. (2011)
13 years ago by @lopusz_kdd
show all tags
general_machine_learning
parallel_programming
mapreduce
general_machine_learningparallel_programmingmapreduce
copydeleteadd this publication to your clipboard
1Next Generation Sequencing in Big Data
C. N. International Journal of Trend in Scientific Research and Development, 2 (4): 379-389 (June 2018)
6 years ago by @ijtsrd
show all tags
analytics
Bi
Next
data
DNA
Strands
model
Bioinformatics
MapReduce
Generation
Sequencing
Sanger
analyticsBiNextdataDNAStrandsmodelBioinformaticsMapReduceGenerationSequencingSanger
copydeleteadd this publication to your clipboard
4Max-cover in map-reduce
F. Chierichetti, R. Kumar, and A. Tomkins. WWW '10: Proceedings of the 19th international conference on World wide web, page 231--240. New York, NY, USA, ACM, (2010)
15 years ago by @muehlburger
show all tags
cover
max
awmhadoop
Hadoop
awm2010
MapReduce
covermaxawmhadoopHadoopawm2010MapReduce
copydeleteadd this publication to your clipboard
4Max-cover in map-reduce
F. Chierichetti, R. Kumar, and A. Tomkins. WWW '10: Proceedings of the 19th international conference on World wide web, page 231--240. New York, NY, USA, ACM, (2010)
15 years ago by @flowolf
show all tags
hadoop-group
cover
max
reduce
awmhadoop
awm2010
hadoop
map
MapReduce
hadoop-groupcovermaxreduceawmhadoopawm2010hadoopmapMapReduce
copydeleteadd this publication to your clipboard
3MaRDiGraS: Simplified Building of Reachability Graphs on Large Clusters
C. Bellettini, M. Camilli, L. Capra, and M. Monga. Reachability Problems, volume 8169 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2013)
10 years ago by @carlobellettini
show all tags
cloud
petrinet
mapreduce
analysis
cloudpetrinetmapreduceanalysis
copydeleteadd this publication to your clipboard
25MapReduce: simplified data processing on large clusters
J. Dean, and S. Ghemawat. Communications of the ACM, 51 (1): 107--113 (2008)
7 years ago by @becker
show all tags
citedby:scholar:timestamp:2017-5-16
inthesis
mapreduce
sparktrails
diss
citedby:scholar:count:20792
paper:fastcor
citedby:scholar:timestamp:2017-5-16inthesismapreducesparktrailsdisscitedby:scholar:count:20792paper:fastcor
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩