The umbrella organization for Particle (High-Energy) Physics-related projects

CERN-HSF (High-Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation facilitates the coordination of common international efforts in high-energy physics software and computing.

CERN (European Organization for Nuclear Research) has participated in GSoC since 2011 as the CERN-SFT group, which provides common software for CERN's experiments. In 2017, the program expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge.

The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software.

The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: data analysis, detector and accelerator simulation, event reconstruction, data management, and many others. We look forward to your contributions!

lightbulb_outline View ideas list


  • c/c++
  • python
  • data analysis
  • concurrency
  • container orchestration


  • Science and Medicine
  • particle physics
  • big data science
  • performance optimization
  • machine learning
  • algorithmics
comment IRC Channel
email Mailing list
mail_outline Contact email

CERN-HSF 2021 Projects

  • Tushar Jain-2
    Accuracy and parallel computation in parton density calculation
    This project is about adding optimised and accurate high-order interpolators to the LHAPDF C++ library. This is required for adding support for...
  • grimmmyshini
    Add numerical differentiation support in Clad
    In mathematics and computer algebra, automatic differentiation (AD) is a set of techniques to numerically evaluate the derivative of a function...
  • parth-07
    Add support for functor objects in clad
    This proposal aims to add support for directly differentiating functors and lambda expressions and increase clad coverage and overcome its...
  • Harsh Prakash Gupta
    Add support for in-browser interactive averaging of physics results
    The Heavy Flavour AVeraging (HFLAV) group is responsible for collecting and combining measurements made at different High Energy Physics (HEP)...
  • Rohit Topi
    CernVM-FS preload capability
    CernVM-FS (CVMFS) is a service for fast and reliable software distribution on a global scale. Data is organized in repositories. Files and metadata...
  • George Dialektakis
    Deep autoencoders for ATLAS data compression
    Storage is one of the main limiting factors to the recording of information from proton-proton collision events at the Large Hadron Collider (LHC),...
  • Simon Thor
    Enhancing the Plotting Functionality in MCnet/Rivet
    Rivet is a tool used by particle physicists to test and develop Monte Carlo event generators, i.e., simulation of particle accelerator collisions....
  • Aman Verma
    Implementation of Physical Shape Function
    Zfit is a highly scalable and customizable model manipulation and fitting library. Using Tensorflow as its backend, it has been optimised for simple...
  • Anurag Akella
    Implementing an application for visualizing the LHCb DAQ network
    The supercomputer that is currently constructed at the LHCb will allegedly be the largest real-time data acquisition system in the world in 2021 and...
  • Ahmat Hamdan
    Inference Code Generation for Recurrent Neural Networks
    Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework, used in...
  • Anubhab Das
    Logging, Unit Test Infrastructure and Precooked Sources for PRMON
    PRocess MONitor (PRMON) is a utility programme used by the Worldwide LHC Computing Grid (WLCG) to monitor the resource consumption and performance of...
  • Matthew Filipovich
    Modern Plotting Machinery for the Large Hadron Collider's Monte Carlo Event Analysis Tool
    The Rivet toolkit is a software package for performing data analysis on simulated particle collision events like those in the Large Hadron Collider...
  • Rakshita Varadarajan
    New protocols for exascale data management with Rucio
    Rucio is an open-source software framework that provides functionality to scientific collaborations to organize, manage, monitor, and access their...
  • Aneesh Chawla
    The goal of the project is to develop and test a suite of ready-to-run benchmarks to measure the performances of various partitioning options on...
  • Adrian-Antonio Petre
    Portability for the Patatrack Pixel Track Reconstruction with Alpaka
    The Compact Muon Solenoid (CMS) experiment is one of the largest experiments at Large Hadron Collider (LHC) that has been built to search for new...
  • Harshal Anil Shende
    RooFit Development - Intuitive Python bindings for RooFit
    RooFit is a C++ library for statistical data analysis and for modeling the expected distribution of observables measured in particle physics...
  • Sanjiban Sengupta
    ROOT Storage of Deep Learning models in TMVA
    The Toolkit for Multivariate Data Analysis with ROOT (TMVA) provides a machine learning environment for the processing and evaluation of multivariate...
  • Archit_Agrawal
    RooUnfold - Efficient deconvolution using state of the art algorithms
    This project aims to address obsolescence in the underlying RooUnfold implementation that would allow the RooUnfold package to act as a lightweight...
  • Rahul Chauhan-1
    Rucio and CS3API to enable data management for the ScienceMesh cloud
    In recent years Sync and Share Services have become more relevant than ever in our day to day lives. The scientists at CERN are not an exception to...
  • Jimil Desai
    Runtime plugin ecosystem support for OCIS
    This project aims to add runtime pluggability to the Reva framework to enhance developer experience at AARNet, ownCloud and CERN. This would involve:...
  • Martin Kostelník
    Single precision support for GPU acceleration in VecGeom
    The project aims to implement single-precision floating-point support for GPU acceleration in the VecGeom library. This will be achieved by first...
  • Aaradhya Saxena
    TMVA Deep Learning Developments - Inference Code Generation for Batch Normalization
    Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework, used in...
  • Gurudatta Patil
    Upgrading the Ganga Graphical User Interface
    Ganga is a tool to run data analysis jobs along with managing associated data files easier. Ganga also allows for submission,...
  • Monalisha Ojha
    Upgrading the Ganga graphical user interface
    Ganga is an open source job management tool for processing and bookkeeping of intensive computational tasks on a wide set of distributed resources....
  • Baidyanath Kundu
    Utilize second order derivatives from Clad in ROOT
    In mathematics and computer algebra, automatic differentiation (AD) is a set of techniques to numerically evaluate the derivative of a function...