Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
R. Bruckner, B. List, and J. Schiefer. DaWaK 2000: Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery, page 317--326. London, UK, Springer-Verlag, (2002)
M. Golfarelli, S. Rizzi, and I. Cella. DOLAP '04: Proceedings of the 7th ACM international workshop on Data warehousing and OLAP, page 1--6. New York, NY, USA, ACM Press, (2004)