tag :: mapreduce Hadoop data

Lesezeichen (verstecken)8
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Hadoop, Pig, and Twitter (NoSQL East 2009)
http://www.slideshare.net/kevinweil/hadoop-pig-and-twitter-nosql-east-2009
vor 13 Jahren von @muehlburger
alle anzeigen
data
database
datamining
hadoop
mapreduce
programming
twitter
datadatabasedatamininghadoopmapreduceprogrammingtwitter
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Dryad - Microsoft Research
http://research.microsoft.com/en-us/projects/dryad/
vor 13 Jahren von @muehlburger
alle anzeigen
cloud
cluster
computing
data
distributed
dryad
grid
hadoop
mapreduce
microsoft
research
scalability
cloudclustercomputingdatadistributeddryadgridhadoopmapreducemicrosoftresearchscalability
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Data Recipes: Graph Processing With Apache Pig
http://thedatachef.blogspot.com/2011/01/graph-processing-with-apache-pig.html
vor 13 Jahren von @muehlburger
alle anzeigen
data
graph
hadoop
mapreduce
pig
r
datagraphhadoopmapreducepigr
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Hadoop Tutorial - YDN
http://developer.yahoo.com/hadoop/tutorial/module5.html#types
vor 13 Jahren von @stroeh
alle anzeigen
custom
data
hadoop
mapreduce
tutorial
type
yahoo
customdatahadoopmapreducetutorialtypeyahoo
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
5Data-Intensive Text Processing with MapReduce
Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well.
vor 15 Jahren von @flowolf
alle anzeigen
awm2010
awmhadoop
data
hadoop
hadoop-group
intensive
mapreduce
processing
text
awm2010awmhadoopdatahadoophadoop-groupintensivemapreduceprocessingtext
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Data-Intensive Information Processing Applications
This course is about scalable approaches to processing large amounts of information (terabytes and even petabytes). We focus mostly on MapReduce, which is presently the most accessible and practical means of computing at this scale, but will discuss other approaches as well.
vor 15 Jahren von @flowolf
alle anzeigen
MapReduce
applications
awm2010
awmhadoop
data
hadoop
hadoop-group
information
intensive
processing
MapReduceapplicationsawm2010awmhadoopdatahadoophadoop-groupinformationintensiveprocessing
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Data-Intensive Information Processing Applications (Spring 2010) | Home
http://www.umiacs.umd.edu/~jimmylin/cloud-2010-Spring/
vor 15 Jahren von @muehlburger
alle anzeigen
awm2010
cloud
computing
course
data
distributed
hadoop
lectures
mapreduce
nlp
awm2010cloudcomputingcoursedatadistributedhadooplecturesmapreducenlp
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2katta - distributed lucene
Katta is a scalable, failure tolerant, distributed, data storage for real time access. Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles. * Makes serving large or high load indices easy * Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers * Replicate shards on different servers for performance and fault-tolerance * Supports pluggable network topologies * Master fail-over * Fast, lightweight, easy to integrate * Plays well with Hadoop clusters * Apache Version 2 License
vor 15 Jahren von @gresch
alle anzeigen
cloud
data
framework
hadoop
indices
java
lucene
mapreduce
search
searchengine
searching
shards
software
tools
clouddataframeworkhadoopindicesjavalucenemapreducesearchsearchenginesearchingshardssoftwaretools
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
1
⟩
⟩⟩

Publikationen (verstecken)3
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

25MapReduce: Simplified Data Processing on Large Clusters
J. Dean, und S. Ghemawat. OSDI, (2004)
vor 14 Jahren von @flowolf
alle anzeigen
awm2010
awmhadoop
data
google
hadoop
hadoop-group
mapreduce
processing
simplified
awm2010awmhadoopdatagooglehadoophadoop-groupmapreduceprocessingsimplified
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Extracting user profiles from large scale data
M. Shmueli-Scheuer, H. Roitman, D. Carmel, Y. Mass, und D. Konopnicki. MDAC '10: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, Seite 1--6. New York, NY, USA, ACM, (2010)
vor 15 Jahren von @muehlburger
alle anzeigen
MapReduce
awm2010
awmhadoop
data
hadoop
mining
profiles
user
MapReduceawm2010awmhadoopdatahadoopminingprofilesuser
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2A novel approach to multiple sequence alignment using hadoop data grids
G. Sadasivam, und G. Baktavatchalam. MDAC '10: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, Seite 1--7. New York, NY, USA, ACM, (2010)
vor 15 Jahren von @muehlburger
alle anzeigen
MapReduce
alignment
awm2010
awmhadoop
data
grids
hadoop
multiple
sequence
MapReducealignmentawm2010awmhadoopdatagridshadoopmultiplesequence
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

Lesezeichen (verstecken)8
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Hadoop, Pig, and Twitter (NoSQL East 2009)

3Dryad - Microsoft Research

1Data Recipes: Graph Processing With Apache Pig

1Hadoop Tutorial - YDN

5Data-Intensive Text Processing with MapReduce

3Data-Intensive Information Processing Applications

3Data-Intensive Information Processing Applications (Spring 2010) | Home

2katta - distributed lucene

Publikationen (verstecken)3
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

25MapReduce: Simplified Data Processing on Large Clusters

2Extracting user profiles from large scale data

2A novel approach to multiple sequence alignment using hadoop data grids

Stöbern

Verwandte Tags

Lesezeichen (verstecken)8 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

Publikationen (verstecken)3 Anzeigeallesnur PublikationenPublikationen pro Seite5102050100 sortieren nachhinzugefügt amTitelAutorErscheinungsdatumEintragstypHilfe für erweiterte Sortierung... RSSBibTeXRDFmehr...

Stöbern

Verwandte Tags

Lesezeichen (verstecken)8
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

Publikationen (verstecken)3
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...