Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
S. Basu, A. Banerjee, и R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, стр. 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (апреля 2004)
A. Phansalkar, A. Joshi, L. Eeckhout, и L. John. IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005., стр. 10--20. (марта 2005)