Apache Spark is an open source big data processing framework built around speed, ease of use, and sophisticated analytics. In this article, Srini Penchikala talks about how Apache Spark framework helps with big data processing and analytics with its standard API. He also discusses how Spark compares with traditional MapReduce implementation like Apache Hadoop.
The main use cases for Spark are iterative Machine Learning algorithms and Interactive analytics. From the ML side -------------------- Most ML algorithms ru...
D. Knoell, M. Atzmueller, C. Rieder, and K. Scherer. Proc. GWEM 2017, co-located with 9th Conference Professional Knowledge Management (WM 2017), Karlsruhe, Germany, KIT, (2017)