Umbrella organization for Particle (High-energy) Physics-related projects

CERN-HSF (High-Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation ( facilitates the coordination of common international efforts in high-energy physics software and computing.

CERN (European Organization for Nuclear Research, has participated in GSoC since 2011 as the CERN-SFT group, which provides common software for CERN's experiments. In 2017, the program expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge.

The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator ( try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software.

The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: data analysis, detector and accelerator simulation, event reconstruction, data management and many others. We look forward to your contributions!

lightbulb_outline View ideas list


  • c/c++
  • python
  • data analysis
  • machine learning
  • parallelization


email Mailing list
mail_outline Contact email

CERN-HSF 2018 Projects

  • Saurav Sachidanand
    CernVM-FS powered WebAssembly I/O
    The goal of this project is to build a JavaScript client for the CernVM filesystem implemented as a pluggable backend library for the Emscripten...
  • Tarang Mahapatra
    Configuration of Lightweight Sites Components using Ansible
    Lightweight Sites is a project at CERN that enables the quick setup of new sites on the grid. Often, a lot of time is spent in setting up new sites...
  • Emmanouil Stergiadis
    Convolutional Deep Neural Networks on GPUs for Particle Physics Applications
    The project's ultimate goal is to provide a GPU implementation for the existing Convolutional Neural Network package within root/tmva. During my...
  • Antares
    CPU Race for Particle Hunting
    Investigate the viability of xtensor, a multi-dimensional array package which aims to bring the API feeling of NumPy to C++, in the ACTS...
  • Ravi Kiran Selvam
    Development of the Deep Learning Optimization Algorithms in TMVA.
    The existing TMVA submodule has always used gradient descent to update the parameters and minimize the cost of the neural networks. More advanced...
  • Jaydeep Nandi
    DIANA-HEP: Analysis Functions:: Implementation and optimization of common particle physics analysis algorithms in a functional paradigm
    High Energy Particle physics experiments typically generate a large amount of data. To analyze them requires significant computing resources. This is...
  • shravan97
    Distributed Big Data Analysis with TDataFrame
    The main objective of this project is to make it easier for researchers/developers to submit distributed jobs for analyzing datasets using TDataFrame...
  • Aleksandr Efremov
    Extend clad - The Automatic Differentiation
    CLAD is a Clang plugin that implements automatic differentiation. Automatic differentiation avoids usual disadvantages of symbolic and numerical...
  • Hanlin Tang
    googledoc Timeline: Before 5/22/2018: Get familiar with ROOT. Get in touch with community. Have a preliminary to-do list of possible optimizing...
  • David Tellenbach
    Faster Matrix Algebra for ATLAS
    Eigen is a C++ template library for linear algebra that aims for high performance in combination of high reliablity and good compiler support. A lot...
  • Dima Savin
    GDML I/O for VecGeom Geometry Package
    VecGeom is the new geometry library developed within the high-energy physics community, aiming to replace the legacy geometry navigation...
  • Anushree Rankawat
    Generative Adversarial Networks for Particle Physics Applications
    Deep Learning Networks have proven to perform well for a wide range of problems, especially those requiring large labelled dataset to learn patterns....
  • Abhinav Kumar
    HAhRD: DeepReconstruction
    One of the challenges faced in Particle Physics Experiment after the collision of particles in LHC is the reconstruction of the events.This includes...
  • tmplt
    Improve multi-threading support for CERN’s Allpix-Squared project using dependency graphs
    Allpix-Squared is a free and open-source simulation framework for silicon tracker and vertex detectors written in modern C++. Its goal is to...
  • Aman Pratap Singh
    Large-scale computing backend for Jupyter notebooks - HTCondor batch job submission and monitoring using the Ganga toolkit
    Jupyter Notebook is an interactive computing environment that creates notebooks which contains computer code as well as rich text elements like...
  • Ayoub Chouak
    Modular YAMPL
    Modular YAMPL The project fundamentally consists in developing a flexible plugin system that allows YAMPL to selectively load essential components at...
  • Yash Srivastava
    Monitoring and traceability of jobs using ElasticSearch - DIRAC
    The DIRAC Interware can handle hundreds of thousands of “jobs” daily. LHCb (the “beauty” experiment at the LHC) is DIRAC’s main user and developer....
  • Mayank Modi
    Open-Source Simulations for Gas Detector on Python
    Magboltz solves the Boltzmann transport equations with numerical integration in order to simulate the interactions of electrons in gas mixtures under...
  • Somesh Singh
    Optimize and Integrate Standalone Tracking Library (SixTrackLib)
    The project targets optimizing the data structures and source code of a standalone tracking library, SixTrackLib, written in C. The aim is to...
  • Ivantha
    Petabyte-Scale Cloud Storage File Manager
    CERNBox is a cloud storage synchronisation service for CERN users: it allows syncing and sharing files on all major mobile and desktop platforms...
  • Mikhail Ivchenko
    Pure-Go XRootD client implementation
    This project aims at implementing a pure-Go BSD-3 client library for XRootD using Go builtin features for concurrency and scalability.
  • Harshit Prasad
    Recurrent Neural Networks and LSTMs on GPUs for Particle Physics Applications
    Toolkit for Multivariate Analysis (TMVA) is a machine learning toolkit for the ROOT scientific software framework used in many particle physics data...
  • Ashwin Samudre
    ROOT package manager
    ROOT is the data processing framework created at CERN - at the heart of the research on high-energy physics. Every day, thousands of physicists use...
  • Asket Agarwal
    Rucio : Billion-row scalable and flexible metadata
    Rucio produces large amounts of metadata for it’s files and datasets which is stored in a central Rucio server. However there is a fixed set of...
  • Mayur Bhosale
    Spark3D: Extend Apache Spark to support 3D Spatial Datasets
    A large amount of 3D data is generated in High Energy Physics & Astrophysics experiments. To process this data efficiently, one would need...
  • SiddharthaRao Kamalakara
    Variational Autoencoders on GPUs for particle physics applications
    Deep Learning relies heavily on a large number of linear operations. Data parallelism is a consequence of this property of Deep Learning.GPUs are...