Inproceedings,

GrayWulf: Scalable Software Architecture for Data Intensive Computing

, , , , , , , , , and .
Hawaii International Conference on System Sciences (HICSS), page 1--10. IEEE, (2009)CORE A.
DOI: 10.1109/HICSS.2009.235

Abstract

Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf. These services are intended for deployment on a cluster of commodity servers similar to the well-known Beowulf clusters. We use the Pan-STARRS system currently under development as an example of the architecture and principles in action.

Tags

Users

  • @simmhan
  • @vinayaka2000
  • @dblp

Comments and Reviews