The umbrella organization for Particle (High-Energy) Physics-related projects

CERN-HSF (High-Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation ( facilitates the coordination of common international efforts in high-energy physics software and computing.

CERN (European Organization for Nuclear Research, has participated in GSoC since 2011 as the CERN-SFT group, which provides common software for CERN's experiments. In 2017, the program expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge.

The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator ( try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software.

The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: data analysis, detector and accelerator simulation, event reconstruction, data management and many others. We look forward to your contributions!

lightbulb_outline View ideas list


  • c/c++
  • python
  • data analysis
  • machine learning
  • concurrency


  • Science and Medicine
  • big data
  • machine learning
  • performance optimization
  • algorithmics
  • particle physics
comment IRC Channel
email Mailing list
mail_outline Contact email

CERN-HSF 2020 Projects

  • Chi Lung Cheng
    (QMLHEP Proposal) Quple - Quantum Machine Learning Framework for High Energy Physics
    This project serves to implement a common framework for applying quantum machine learning algorithms to high energy physics analysis. A major focus...
  • Anish Biswas
    Awkward Array GPU Kernels
    The goal of this project is to add a GPU backend to Awkward Array 1.x. With such a backend, the library would have a unique capability of processing...
  • K Pranath Reddy
    Building a Python-based framework for Unsupervised Deep Learning applications in strong lensing cosmology
    Gravitational lensing has been a cornerstone in many cosmology experiments, and studies since it was discussed in Einstein’s calculations back in...
  • Mohit Tyagi
    CernVM-FS: Podman cvmfs integration
    It has been shown that only a small portion of all the files in a container image is necessary to run the image itself. This is even more accentuated...
  • Nidhi Hegde
    Cosmic-Ray Imaging Studies via Mission-Imagery from Space (CRISMIS)
    CRISMIS is an open-source, AI-based tool for the identification and classification of cosmic-ray artifacts in imaging data. Its purpose is to cater...
  • Prateek Agnihotri
    Deep Learning Algorithms for Muon Momentum Estimation in the CMS Trigger System
    CMS is a general-purpose detector at LHC. During a run, it generates about 40 TB data per second. Since It is not feasible to readout and store such...
  • Honey Gupta
    Deep-compression for HEP data
    Storage is one of the main limiting factors to the recording of information from proton-proton collision events at the Large Hadron Collider at CERN....
  • Surya S Dwivedi
    Development of 3D CNN in TMVA
    This project is about the development of 3D CNN functionality in TMVA. I shall be developing both 3D convolution as well as 3D pooling layers. 3D...
  • Vaibhav Garg
    Enable Modules on Windows
    ROOT has several features that interact with libraries and require implicit header inclusion. This can be triggered by reading or writing data on...
  • Shravan Chaudhari-1
    End-to-End Deep Learning Reconstruction for CMS Experiment
    Developing End-to-End Deep Learning Models and optimizing them for the Reconstruction of single particles, jets and event topologies of interest in...
  • aditigupta17
    Extend SkyhookDM programmable object storage with statistics, sort/aggregate or data compaction functions.
    SkyhookDM supports dynamic data management in the cloud by enabling data management tasks to be executed directly within the storage. It uses...
  • Ali Hariri
    Falcon: Fast Simulation using Deep Generative Models
    Prior to the HL-LHC operation, detector simulations undergo further developments in order to adapt to the increasing amounts of events. Current...
  • Max Orok
    Fast Merging of RNTuple Data Sets
    The RNTuple interface is an experimental I/O subsystem for the ROOT analysis framework. Preliminary benchmarks have demonstrated significant...
  • Kalana Wijethunga
    Framework for Automating the Retrieval and Analysis of System Information of Grid sites in WLCG
    JAliEn introduces automatic containerization for jobs submitted to the WLCG. To fully utilize this feature, and to plan ahead for how new software...
  • Huajian Qiu
    GeoModelExplorer: improving interactive 3D geometry visualization tool based on Qt3D
    Interactive visualization can provide intuitive insights for experiment setups and results in many domains, including High Energy Physics. Therefore,...
  • Stanislav Lukyanenko
    Graph Neural Networks
    Graph neural networks is a powerful generalization of a whole class of neural (and not only neural) architectures on the data, organized in a graph....
  • Shuo Liu
    Hist: histogramming for analysis powered by boost-histogram
    The Scikit-HEP project is a collection of several dozen packages intended to facilitate the use of Python in High Energy Physics. One of the major...
  • Andrei Mihailescu
    Implementation of Gilbert-Johnson-Keerthi Algorithm for Convex Shapes in VecGeom
    The goal of this project is to implement the Gilbert-Johnson-Keerthi algorithm on convex objects in VecGeom along with a ray casting algorithm that...
  • Varun Bankar
    Implementation of Graphical User Interface for Ganga
    The idea of this project is to create an additional interface for users to interact with Ganga. The new responsive Graphical User Interface (GUI)...
  • Muhammad Aditya Hilmy
    Integration of Rucio in JupyterLab for SWAN
    CERN and the high energy physics community stores their data in various storage, with varying locations and medium (from object storage to magnetic...
  • Rahul Indra
    Intelligent Alert Management System For HEP experiments
    This project aims to develop an intelligent and reliable monitoring system for large distributed services to monitor their status and reduce...
  • Shivansh Saini
    Kubernetes operator for XRootD
    This project aims to develop a Kubernetes Operator for XRootD, along with its related documentation, in order to ease and fully automate deployment...
  • drholmie
    Local replica of JAliEn central services for research and development
    In preparation of LHC's next run, CERN has been developing JAliEn, a successor to their current grid computing middleware, AliEn. To ease development...
  • Yash Datta
    Manipulation of massive astronomical data using graphs
    AstroLab Software has developed Fink, an apache-spark based broker infrastructure, which is able to analyze this large stream of alerts data from the...
  • Aditya Kumar-2
    MCnet/Rivet - Speed and accuracy in the LHC's MC analysis tool
    Rivet is a software package for performing data analysis on simulated particle collision events like those in the Large Hadron Collider. This project...
  • Nick Rozinsky
    Enhance a statistical toolkit for physics analysis interpretation.
  • Vivek Nigam
    Native Desktop Application for Rucio
    Rucio is a data management framework which provides a multilevel organization, management, and access to large scale data generated during scientific...
  • Fawad
    Phoenix - Experiment Independent Event Display
    Phoenix is a web-based, experiment independent event display framework in JavaScript for visualizing HEP experiment data from multiple experiments....
  • Keisuke Kamahori
    Pre-conditioners applied to ROOT compression algorithms
    This project aims to validate the possibility of using pre-conditioners in compressing ROOT file formats (TTree and RNTuple); to understand how we...
  • Lucas Camolezi
    Reduce boost dependence in CMSSW
    This project has the goal to find and decrease boost dependencies in CMSSW. Modern C++ introduced a lot of new features that were only available...
  • Vasilis Mageirakos
    Support for Rucio Users with Natural Language Processing
    The aim of the project is to use Natural Language Processing (NLP) to develop an intelligent bot able to provide satisfying answers to users and...
  • Anirudh Dagar
    TMVA Interfaces - Pytorch
    Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework, used in...
  • Ratin Kumar
    Upgrading the Ganga user interface to use a relational database for persistent storage
    This project will replace the old xml-based metadata system with a database, which will be assisted by a caching mechanism that will allow for a...
  • Ayaan Zaidi
    Workflow configuration import and validation for AliECS
    The data-driven workflow dumps generated by the DPL (O2 Data Processing Layer) cannot be directly imported into AliECS. These workflow templates are...