Umbrella for Particle Physics-related projects

Technologies
python, c/c++, data analysis, artificial intelligence, container orchestration
Topics
machine learning, big data, algorithmics, particle physics, Performance Optimisation
Umbrella for Particle Physics-related projects
CERN-HSF (High-Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation (http://hepsoftwarefoundation.org/) facilitates the coordination of common international efforts in high-energy physics software and computing. CERN (European Organization for Nuclear Research, https://home.cern) has participated in GSoC since 2011 as the CERN-SFT group, which provides common software for CERN's experiments. In 2017, the program expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge. The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator (http://home.cern/topics/large-hadron-collider) try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software. The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: data analysis, detector and accelerator simulation, event reconstruction, data management and many others. We look forward to your contributions!
2022 Program

Successful Projects

Contributor
Kush Kothari
Mentor
Jim Pivarski, David Lange
Organization
CERN-HSF
[IRIS-HEP] Uproot + Dask
This project aims to create an API for users that can provide the data in ROOT files directly in a “delayed” form that is supported by Dask. It will...
Contributor
Zephyr
Mentor
Javier Lopez-Gomez, Jakob Blomer
Organization
CERN-HSF
ROOT - Automatic conversion of data stored in TTree form to RNTuple
TTree is the ROOT’s legacy columnar storage that has been used to store more than 1 exabyte of high-energy physics data during the last 25 years....
Contributor
Kavishka Attanayake
Mentor
Beomki Yeo, Charles Leggett
Organization
CERN-HSF
Acts GPU R&D - Optimization of GPU tracking pipeline
Traccc is a demonstrator for GPU tracking algorithms. This project focuses on optimizing GPU utilization of traccc using CUDA Multi Process Service...
Contributor
SimoneBalducci
Mentor
Wahid Redjeb, Antonio Di Pilato, Felice Pantaleo, Marco Rovere
Organization
CERN-HSF
Implementation of a Python library that generalizes the CLUE clustering algorithm
The CLUE algorithm is a clustering algorithm written by Rovere et al. in C++, which is used in High-Energy Physics to reconstruct particle showers by...
Contributor
Neel Shah
Mentor
Lorenzo Moneta, SItong An, Ahmat
Organization
CERN-HSF
ROOT - TMVA SOFIE Developments - Inference Code Generation for Deep Learning models
Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework, used in...
Contributor
Manish Kausik
Mentor
Vassil Vassilev, David Lange, William Moses
Organization
CERN-HSF
Add Initial Integration of Clad with Enzyme
Clad is an open source plugin to the Clang compiler that detects from the parsed Abstract syntax tree, calls to differentiate a defined function,...
Contributor
Sanchi Mittal
Mentor
Lorenzo Moneta, Sanjiban Sengupta, SItong An, Omar Zapata
Organization
CERN-HSF
ROOT - Machine Learning Developments - Batch Generator for training machine learning models
Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework, used in...
Contributor
Tirthankar Mazumder
Mentor
Paul Gessinger, Joana Niermann, Hadrien
Organization
CERN-HSF
Vectorized Linear Algebra Implementation for ACTS
This project is for implementing a vectorized linear algebra backend. The current linear algebra implementation uses scalar instructions, but we can...
Contributor
Divyansh Tiwari
Mentor
Soon Yung Jun, John Apostolakis, Renee Fatemi
Organization
CERN-HSF
Symplectic Integrators
The Geat4 toolkit lacks numerical integrators that preserve energy over the long course of simulations. Specifically, the g-2 physics experiment at...
Contributor
Giorgos Apostolopoulos
Mentor
Alexander Richards, Ulrik, Mark Smith
Organization
CERN-HSF
A concurrency model for the monitoring in Ganga
Ganga routinely handles a tremendous amount of computational tasks simultaneously and keeping track of their status is prone to lagging behind with...
Contributor
Priyam Mehta
Mentor
Dalila Salamani, Anna
Organization
CERN-HSF
Geant4-FastSim - Memory footprint optimization for ML fast shower simulation
Geant4 is a highly accurate and detailed simulation toolkit used for simulating the passage of particles through matter. Due to its strict precision...
Contributor
Guneet Singh Kohli
Mentor
Dalila Salamani, Anna
Organization
CERN-HSF
Project Proposal Geant4-FastSim - ML pipeline optimization using Kubeflow
This is the tentative proposal submission for the Geant4-FastSim - Building an ML pipeline for fast shower simulation. This proposal discusses a...
Contributor
Soumil Baldota
Mentor
Benedikt Hegner, Thomas Madlener
Organization
CERN-HSF
Interfacing PODIO to Julia
Currently, two major languages are used in high-energy physics (HEP): C++ for numerically intensive code, where execution speed is critical, and...
Contributor
Harshil Jani
Mentor
Guilherme Amadio, Bernhard Manfred Gruber
Organization
CERN-HSF
Geant4 - Performance Data Visualization using d3.js
The project aims at monitoring performance and improving the data visuals used to represent the performance report for Geant4 which is a toolkit for...
Contributor
Jun Zhang
Mentor
Vassil Vassilev, Alexander Penev, David Lange
Organization
CERN-HSF
Optimize ROOT use of modules for large codebases
ROOT is a data analysis framework designed to handle large amounts of data with high performance. This proposal aims at optimizing the performance of...
Contributor
arul
Mentor
Thomas Kuhr, Nikolai Hartmann
Organization
CERN-HSF
Advanced Belle II Software Validation
The international Belle II collaboration handles huge amounts of data from the Belle II detector, relying on custom software for simulation,...
Contributor
Nimish B
Mentor
Martin B, Mario Lassnig, Mayank Sharma, cserfon
Organization
CERN-HSF
Rucio WebUI Revamp
Rucio has proven its potential to be used for providing functionality to scientific collaborations to organize, manage, monitor, and access their...
Contributor
Harshal Shende
Mentor
Lorenzo Moneta, Sanjiban Sengupta, Omar Zapata
Organization
CERN-HSF
TMVA Developments - Improve Python interface for TMVA
Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework. The...
Contributor
Ujwal Kundur
Mentor
Sylvester Joosten, Wouter Deconinck
Organization
CERN-HSF
Electromagnetic Cluster Finding on GPUs
The goal of this project is to re-implement the current clustering algorithms using SYCL for higher-throughput processing and handling future...
Contributor
Mohammad Humayun Khan
Mentor
Fawad Ali, Edward Moyse
Organization
CERN-HSF
Revamped Testing Infrastructure for Phoenix
This project aims to introduce a new testing strategy so that we encourage Behaviour-driven Development here, at Phoenix — the official web event...
Contributor
Kyle Fan
Mentor
BenjaminFuks, jackaraz
Organization
CERN-HSF
MadAnalysis 5 - Integration of theoretical uncertainty calculation with multi-weight integration
The purpose of this project is to enable multi-weight calculations in the MadAnalysis 5 platform. The current implementation only uses the nominal...