Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
D. Cutting, D. Karger, J. Pedersen, und J. Tukey. SIGIR '92: Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, Seite 318--329. New York, NY, USA, ACM Press, (1992)
J. MacQueen. Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, Seite 281-297. University of California Press, (1967)
J. MacQueen. Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, Seite 281-297. University of California Press, (1967)