Big data analytics has become increasingly popular in many application domains including medical imaging applications, location based services and other geospatial problems. MapReduce is and has been one of the most popular model for high performance distributed computing and data analytics. Multidimensional data cubes on the other hand have been supported by databases for large scale business intelligence. In this project we will combine the two worlds and implement a multi-dimensional data cube in Apache Spark for high performance analytics.


Sameer Sonawane


  • bmi-code
  • Furqan Baig