Posted on Oct 21, 2014
Hadoop and MapReduce were never designed for real-time highly interactive business data analytics. While these two tools went a long way towards simplifying the analysis of big data, they are, in some ways, the victims of their own success.
Once enabled to mine huge datasets for insights, analysts wanted more complex reporting, more interactive, ad-hoc, reporting, and real-time online processing. How to build this on top of a toolset designed for batch jobs?
Enter Apache Spark, an...