The big data infrastructures depend heavily on the Java virtual machine ecosystem. Although Java software packaging is quite simple and straightforward for a single package, it is complicated for the whole ecosystem. A Java package usually contains a lot of third-party files to complete its dependency, and another package may include the same files. It is a waste a disk space and makes Java packages challenging to maintain. This project aims at parsing the metadata of Maven packages and presenting an up-to-date overlay containing full dependencies of Java software, such as Spark.



Zongyu Zhang


  • Miroslav Ć ulc
  • Benda Xu
  • Andrey Savchenko