Parallel or distributed mining,Cluster-based data mining algorithms and systems,Grid-based data mining,lgorithms and systems;Peer-to-Peer based data mining algorithms and systems;Data mining algorithms and systems based on parallel hardware platforms
Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
{. Schouten, {. Bueno, W. Duivesteijn, and M. Pechenizkiy. Data Mining and Knowledge Discovery, 36 (1):
379--413(January 2022)Funding Information: This research is supported by EDIC project funded by NWO. We thank the EDIC consortium and the ZGT hospital for allowing us to analyse the data from the DIALECT-2 study. We especially thank Niala Den Braber (PhD candidate at Universiteit Twente and researcher internal medicine at ZGT hospital) and prof. dr. Goos Laverman (internist-nephrologist at ZGT hospital) for giving us clinical valuation of our findings. In addition, we thank our colleagues dr. Robert Peharz for giving us useful insights on Markov chains and DBNs and dr. Maryam Tavakol for guiding us towards the MovieLens dataset..