tag :: cluster hadoop

bookmarks (hide)50
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

4Apache Mesos
http://mesos.apache.org/
8 years ago by @bshanks
show all tags
cluster
distributed
mesos
clustering
scale
production
hadoop
microservice
deploy
clusterdistributedmesosclusteringscaleproductionhadoopmicroservicedeploy
(0)
copydelete
- community post
- history of this post
1Apache Spark™ - Lightning-Fast Cluster Computing
http://spark.apache.org/index.html
8 years ago by @bshanks
show all tags
hadoop
data
bigdata
cluster
spark
scale
hadoopdatabigdataclustersparkscale
(0)
copydelete
- community post
- history of this post
11. For All Nodes - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.0.9.1/bk_cluster-planning-guide/content/all_nodes.html
10 years ago by @becker
show all tags
amount
cloudera
cluster
hadoop
setup
swap
amountclouderaclusterhadoopsetupswap
(0)
copydelete
- community post
- history of this post
1[SOLVED] What has changed with nic bonding?? - Page 2
http://ubuntuforums.org/showthread.php?t=1932623&page=2
10 years ago by @becker
show all tags
bonding
cloudera
cluster
ether
hadoop
hwaddress
mac
setup
bondingclouderaclusteretherhadoophwaddressmacsetup
(0)
copydelete
- community post
- history of this post
311. Determine YARN and MapReduce Memory Configuration Settings - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.0.6.0/bk_installing_manually_book/content/rpm-chap1-11.html
10 years ago by @jaeschke
show all tags
cluster
configuration
hadoop
mapreduce
memory
yarn
clusterconfigurationhadoopmapreducememoryyarn
(0)
copydelete
- community post
- history of this post
37 Deadly Hadoop Misconfigurations
http://archive.apachecon.com/na2013/presentations/27-Wednesday/Big_Data/14:45-7_Deadly_Hadoop_Misconfigurations-Kathleen_Ting/HadoopTroubleshootingApacheCon.pdf
10 years ago by @jaeschke
show all tags
bigdata
cluster
configuration
hadoop
l3s
mapreduce
bigdataclusterconfigurationhadoopl3smapreduce
(0)
copydelete
- community post
- history of this post
1Apache Hadoop YARN: Avoiding 6 Time-Consuming "Gotchas" | Cloudera Developer Blog
http://blog.cloudera.com/blog/2014/04/apache-hadoop-yarn-avoiding-6-time-consuming-gotchas/
10 years ago by @jaeschke
show all tags
cloudera
cluster
hadoop
l3s
performance
scheduling
yarn
clouderaclusterhadoopl3sperformanceschedulingyarn
(0)
copydelete
- community post
- history of this post
2How-to: Select the Right Hardware for Your New Hadoop Cluster | Cloudera Developer Blog
http://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/
10 years ago by @jaeschke
show all tags
cloudera
cluster
hadoop
hardware
clouderaclusterhadoophardware
(0)
copydelete
- community post
- history of this post
1Hadoop Operations: Safari Books Online
9781449327279 - Hadoop Operations - If you’ve been tasked with the job of maintaining large and complex Hadoop clusters, or are about to be, this book is a must. You’ll learn the particulars of Hadoop operations, from planning, installing, and configuring the system to providing ongoing maintenance.
10 years ago by @jaeschke
show all tags
admin
cluster
hadoop
l3s
adminclusterhadoopl3s
(0)
copydelete
- community post
- history of this post
1Benchmarking and Stress Testing an Hadoop Cluster with TeraSort, TestDFSIO & Co. - Michael G. Noll
How to benchmark and stress test an Apache Hadoop cluster with built-in benchmark tools such as TeraSort and TestDFSIO
10 years ago by @jaeschke
show all tags
benchmark
cluster
hadoop
test
benchmarkclusterhadooptest
(0)
copydelete
- community post
- history of this post
1Hakuna MapData! » Blog Archive » Celebrate failure(s) – a real-world Hadoop example (HDFS issues)
http://hakunamapdata.com/celebrate-failures-a-real-world-hadoop-example-hdfs-issues/
10 years ago by @jaeschke
show all tags
cluster
hadoop
problem
troubleshooting
clusterhadoopproblemtroubleshooting
(0)
copydelete
- community post
- history of this post
1Optimal Hadoop Cluster Tutorial
http://www.atlantbh.com/how-to-build-optimal-hadoop-cluster/
11 years ago by @dallmann
show all tags
cluster
hadoop
clusterhadoop
(0)
copydelete
- community post
- history of this post
2Cloudera: How to select the right hardware for your hadoop cluster
http://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/
11 years ago by @dallmann
show all tags
cluster
hadoop
clusterhadoop
(0)
copydelete
- community post
- history of this post
1Cloud9: A Hadoop toolkit for working with big data
http://lintool.github.io/Cloud9/
11 years ago by @jaeschke
show all tags
cluster
hadoop
mapreduce
pagerank
tool
trec
warc
wikipedia
clusterhadoopmapreducepageranktooltrecwarcwikipedia
(0)
copydelete
- community post
- history of this post
3Apache Spark™ - Lightning-Fast Cluster Computing
https://spark.incubator.apache.org/
11 years ago by @nosebrain
show all tags
apache
cluster
computing
hadoop
mapreduce
spark
apacheclustercomputinghadoopmapreducespark
(0)
copydelete
- community post
- history of this post
3Apache Spark™ - Lightning-Fast Cluster Computing
https://spark.incubator.apache.org/
11 years ago by @jaeschke
show all tags
cluster
computing
hadoop
high
hpc
learning
machine
performance
spark
clustercomputinghadoophighhpclearningmachineperformancespark
(0)
copydelete
- community post
- history of this post
1Elasticsearch in Production
https://www.found.no/foundation/elasticsearch-in-production/
11 years ago by @jaeschke
show all tags
cluster
elasticsearch
es
hadoop
clusterelasticsearcheshadoop
(0)
copydelete
- community post
- history of this post
1Hadoop Map Reduce Next Generation-2.2.0 - Cluster Setup
https://hadoop.apache.org/docs/current2/hadoop-project-dist/hadoop-common/ClusterSetup.html
11 years ago by @daschloer
show all tags
Cluster
Hadoop
Setup
ClusterHadoopSetup
(0)
copydelete
- community post
- history of this post
1Mellanox Products: Unstructured Data Accelerator (UDA)
http://www.mellanox.com/page/products_dyn?product_family=144&menu_section=69
11 years ago by @jaeschke
show all tags
cluster
hadoop
infiniband
uda
clusterhadoopinfinibanduda
(0)
copydelete
- community post
- history of this post
17 Tips for Improving MapReduce Performance | Cloudera Developer Blog
http://blog.cloudera.com/blog/2009/12/7-tips-for-improving-mapreduce-performance/
11 years ago by @jaeschke
show all tags
cloudera
cluster
hadoop
performance
speed
tuning
clouderaclusterhadoopperformancespeedtuning
(0)
copydelete
- community post
- history of this post
1How Does Cloudera Manager Work? | Cloudera Developer Blog
http://blog.cloudera.com/blog/2013/07/how-does-cloudera-manager-work/
11 years ago by @jaeschke
show all tags
cloudera
cluster
configuration
hadoop
manager
clouderaclusterconfigurationhadoopmanager
(0)
copydelete
- community post
- history of this post
1Top ten tips tricks for hadoop success r9
The Cloudera Solutions team shares their insights into getting the most out of your Hadoop deployment. Webinar recording available on www.cloudera.com/events
11 years ago by @jaeschke
show all tags
cloudera
cluster
hadoop
optimization
clouderaclusterhadoopoptimization
(0)
copydelete
- community post
- history of this post
1Hadoop at Twitter (part 1): Splittable LZO Compression | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2009/11/hadoop-at-twitter-part-1-splittable-lzo-compression/
11 years ago by @jaeschke
show all tags
cluster
hadoop
lzo
clusterhadooplzo
(0)
copydelete
- community post
- history of this post
1Ramblings of a distributed computing programmer
http://grepalex.com/2013/02/25/hadoop-libjars/
11 years ago by @jaeschke
show all tags
cluster
hadoop
jar
library
clusterhadoopjarlibrary
(0)
copydelete
- community post
- history of this post
1Handling dependencies and configuration in Java + Hadoop projects efficiently | Datasalt
http://www.datasalt.com/2011/05/handling-dependencies-and-configuration-in-java-hadoop-projects-efficiently/
11 years ago by @jaeschke
show all tags
cluster
hadoop
jar
clusterhadoopjar
(0)
copydelete
- community post
- history of this post
2HCatalog, tables and metadata for Hadoop | hadoopnew - Yahoo!
From the blog hadoopnew: Last month the HCatalog project (formerly known as Howl) was accepted into the Apache Incubator. We have already branched for a 0.1 release, which we hope to push in the next few weeks. Given all this activity, I thought it … Continue reading →
11 years ago by @jaeschke
show all tags
cluster
hadoop
hcatalog
clusterhadoophcatalog
(0)
copydelete
- community post
- history of this post
1lintool/clueweb
clueweb - Hadoop tools for manipulating ClueWeb collections
11 years ago by @jaeschke
show all tags
clueweb
cluster
crawling
hadoop
warc
cluewebclustercrawlinghadoopwarc
(0)
copydelete
- community post
- history of this post
1Hadoop World 2011: Hadoop Troubleshooting 101 - Kate Ting - Cloudera
http://de.slideshare.net/cloudera/hadoop-troubleshooting-101-kate-ting-cloudera
11 years ago by @dbenz
show all tags
best-practice
cloudera
cluster
configuration
hadoop
jobtracker
memory
tipps
best-practiceclouderaclusterconfigurationhadoopjobtrackermemorytipps
(0)
copydelete
- community post
- history of this post
1Hadoop von Intel: 1 TByte Daten in 7 Minuten statt 4 Stunden analysieren - Golem.de
Der Chiphersteller Intel hat eine eigene Hadoop-Distribution veröffentlicht, die speziell für die eigenen Prozessoren angepasst ist. Sie soll Daten deutlich schneller analysieren, vor allem,
12 years ago by @jaeschke
show all tags
bigdata
cluster
hadoop
intel
bigdataclusterhadoopintel
(0)
copydelete
- community post
- history of this post
2GraphLab - Large-Scale Machine Learning on Graphs
http://graphlab.org/
12 years ago by @jaeschke
show all tags
bigdata
cluster
graph
graphlab
hadoop
hpc
ml
parallel
bigdataclustergraphgraphlabhadoophpcmlparallel
(0)
copydelete
- community post
- history of this post
15The Julia Language
http://julialang.org/
12 years ago by @jaeschke
show all tags
cluster
hadoop
hpc
julia
language
mapreduce
programming
todo
clusterhadoophpcjulialanguagemapreduceprogrammingtodo
(0)
copydelete
- community post
- history of this post
1Mellanox Technologies: Hadoop
http://www.mellanox.com/content/pages.php?pg=hadoop
12 years ago by @jaeschke
show all tags
cluster
hadoop
hardware
mapreduce
clusterhadoophardwaremapreduce
(0)
copydelete
- community post
- history of this post
1Hadoop Tutorial - YDN
Module 7: Managing a Hadoop Cluster
12 years ago by @jaeschke
show all tags
cluster
hadoop
hardware
mapreduce
clusterhadoophardwaremapreduce
(0)
copydelete
- community post
- history of this post
2Cloudera’s Support Team Shares Some Basic Hardware Recommendations | Apache Hadoop for the Enterprise | Cloudera
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
12 years ago by @jaeschke
show all tags
cloudera
cluster
hadoop
hardware
mapreduce
clouderaclusterhadoophardwaremapreduce
(0)
copydelete
- community post
- history of this post
2Cloudera’s Support Team Shares Some Basic Hardware Recommendations | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2010/03/clouderas-support-team-shares-some-basic-hardware-recommendations/
12 years ago by @dbenz
show all tags
bigdata
cloudera
cluster
hadoop
hardware
howto
recommendations
bigdataclouderaclusterhadoophardwarehowtorecommendations
(0)
copydelete
- community post
- history of this post
1Configuration Parameters: What can you just ignore? | Apache Hadoop for the Enterprise | Cloudera
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
12 years ago by @nosebrain
show all tags
cluster
config
hadoop
parameters
clusterconfighadoopparameters
(0)
copydelete
- community post
- history of this post
1Developer Blog: Setting up a Hadoop cluster - Part 1: Manual Installation
http://gbif.blogspot.de/2011/01/setting-up-hadoop-cluster-part-1-manual.html
12 years ago by @nosebrain
show all tags
cluster
config
hadoop
parameters
clusterconfighadoopparameters
(0)
copydelete
- community post
- history of this post
1Hardware Recommendations for Apache Hadoop
http://docs.hortonworks.com/CURRENT/index.htm#About_Hortonworks_Data_Platform/Hardware_Recommendations_For_Apache_Hadoop.htm
12 years ago by @jaeschke
show all tags
cluster
hadoop
hardware
clusterhadoophardware
(0)
copydelete
- community post
- history of this post
2Welcome to Hive!
http://hive.apache.org/
12 years ago by @folke
show all tags
cluster
hadoop
hive
clusterhadoophive
(0)
copydelete
- community post
- history of this post
1MarkLogic Connector for Hadoop — MarkLogic Developer Community
http://developer.marklogic.com/products/hadoop
12 years ago by @sac
show all tags
cluster
hadoop
marklogic
xml
clusterhadoopmarklogicxml
(0)
copydelete
- community post
- history of this post
2Running Hadoop On Ubuntu Linux (Multi-Node Cluster) @ Michael G. Noll
In this tutorial, I will describe how to setup a multi-node Hadoop cluster. What we want to do In this tutorial, I will describe the required steps
13 years ago by @nosebrain
show all tags
cluster
hadoop
linux
ubuntu
clusterhadooplinuxubuntu
(0)
copydelete
- community post
- history of this post
3Dryad - Microsoft Research
http://research.microsoft.com/en-us/projects/dryad/
13 years ago by @muehlburger
show all tags
cloud
cluster
computing
data
distributed
dryad
grid
hadoop
mapreduce
microsoft
research
scalability
cloudclustercomputingdatadistributeddryadgridhadoopmapreducemicrosoftresearchscalability
(0)
copydelete
- community post
- history of this post
8Apache Mahout: Scalable machine learning and data mining
http://mahout.apache.org/
13 years ago by @telekoma
show all tags
apache
bachelor:2011:bachmann
classification
cluster
hadoop
learning
library
mapred
apachebachelor:2011:bachmannclassificationclusterhadooplearninglibrarymapred
(0)
copydelete
- community post
- history of this post
2Spark Cluster Computing Framework
Spark is a fast, in-memory cluster computing framework with a language-integrated interface in Scala. It shines at iterative MapReduce (e.g. machine learning) and interactive data mining, where keeping data in memory provides substantial speedups.
13 years ago by @sac
show all tags
cluster
hadoop
map_reduce
parallelization
scala
clusterhadoopmap_reduceparallelizationscala
(0)
copydelete
- community post
- history of this post
1HOD User Guide
Hadoop On Demand (HOD) is a system for provisioning virtual Hadoop clusters over a large physical cluster. It uses the Torque resource manager to do node allocation. On the allocated nodes, it can start Hadoop Map/Reduce and HDFS daemons. It automatically generates the appropriate configuration files (hadoop-site.xml) for the Hadoop daemons and client. HOD also has the capability to distribute Hadoop to the nodes in the virtual cluster that it allocates. In short, HOD makes it easy for administrators and users to quickly setup and use Hadoop. It is also a very useful tool for Hadoop developers and testers who need to share a physical cluster for testing their own Hadoop versions.
14 years ago by @tgunkel
show all tags
cluster
diplom
hadoop
torque
clusterdiplomhadooptorque
(0)
copydelete
- community post
- history of this post
19Welcome to Apache Hadoop!
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
16 years ago by @carlfischer
show all tags
apache
cluster
distributed
grid
hadoop
java
mapreduce
opensource
apacheclusterdistributedgridhadoopjavamapreduceopensource
(0)
copydelete
- community post
- history of this post
2Welcome to HBase!
HBase is the Hadoop database. Its an open-source, distributed, column-oriented store modeled after the Google paper, Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop. HBase's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Try it if your plans for a data store run to big.
16 years ago by @carlfischer
show all tags
bigtable
cluster
database
distributed
hadoop
hbase
opensource
bigtableclusterdatabasedistributedhadoophbaseopensource
(0)
copydelete
- community post
- history of this post
1Hbase - Hadoop Wiki
HBase: Bigtable-like structured storage for Hadoop HDFS Just as Google's [WWW] Bigtable leverages the distributed data storage provided by the [WWW] Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Core. Data is organized into tables, rows and columns. An Iterator-like interface is available for scanning through a row range (and of course there is the ability to retrieve a column value for a specific key). Any particular column may have multiple versions for the same row key.
16 years ago by @carlfischer
show all tags
bigtable
cluster
database
distributed
hadoop
hbase
java
mapreduce
bigtableclusterdatabasedistributedhadoophbasejavamapreduce
(0)
copydelete
- community post
- history of this post
1Amazon Web Services Developer Community : Running Hadoop MapReduce on Amazon EC2 and Amazon S3
Apache's Hadoop project aims to solve these problems by providing a framework for running large data processing applications on clusters of commodity hardware. Combined with Amazon EC2 for running the application, and Amazon S3 for storing the data, we can run large jobs very economically. This paper describes how to use Amazon Web Services and Hadoop to run an ad hoc analysis on a large collection of web access logs that otherwise would have cost a prohibitive amount in either time or money.
16 years ago by @carlfischer
show all tags
amazon
apache
cluster
distributed
ec2
hadoop
mapreduce
programming
amazonapacheclusterdistributedec2hadoopmapreduceprogramming
(0)
copydelete
- community post
- history of this post
1HadoopMapReduce - Hadoop Wiki
Introduction This document describes how Map and Reduce operations are carried out in Hadoop. If you are not familiar with the Google [WWW] MapReduce programming model you should get acquainted with it first.
16 years ago by @carlfischer
show all tags
apache
cluster
google
hadoop
mapreduce
programming
apacheclustergooglehadoopmapreduceprogramming
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)3
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Big Data Analytics using in Healthcare Management System
G. S. International Journal of Trend in Scientific Research and Development, 4 (4): 585-587 (May 2020)
4 years ago by @ijtsrd
show all tags
Big
Data
Hadoop
Healthcare
Personalized
analytics
cluster
health
medicine
promotion
BigDataHadoopHealthcarePersonalizedanalyticsclusterhealthmedicinepromotion
(0)
copydeleteadd this publication to your clipboard
3Beyond Hadoop
G. Mone. Communications of the ACM, 56 (1): 22--24 (January 2013)
12 years ago by @jaeschke
show all tags
cluster
computing
hadoop
hpc
mapreduce
real-time
clustercomputinghadoophpcmapreducereal-time
(0)
copydeleteadd this publication to your clipboard
6Graph Twiddling in a MapReduce World.
J. Cohen. Computing in Science and Engineering, 11 (4): 29-41 (2009)
14 years ago by @arvid.heise
show all tags
cluster
component
graph
hadoop
truss
clustercomponentgraphhadooptruss
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

bookmarks (hide)50 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)3 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

bookmarks (hide)50
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)3
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...