Umbrella organization for Particle (High-Energy) Physics-related projects

Technologies
python, c/c++, machine learning, data analysis, parallelization
Topics
machine learning, big data, physics, particle physics, high-energy physics
Umbrella organization for Particle (High-Energy) Physics-related projects

CERN-HSF (High-Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation (http://hepsoftwarefoundation.org/) facilitates the coordination of common international efforts in high-energy physics software and computing.

CERN (European Organization for Nuclear Research, https://home.cern) has participated in GSoC since 2011 as the CERN-SFT group, which provides common software for CERN's experiments. In 2017, the program expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge.

The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator (http://home.cern/topics/large-hadron-collider) try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software.

The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: data analysis, detector and accelerator simulation, event reconstruction, data management and many others. We look forward to your contributions!

2019 Program

Successful Projects

Contributor
Charles Escott
Mentor
David Lange, Jim Pivarski
Organization
CERN-HSF
CERN Awkward Array Project
At CERN, the data from LHC collisions requires complex data types and functions to be processed. As a solution, the awkward-array library makes...
Contributor
Ishan Rai
Mentor
Alexander Richards, Mark Smith, Ulrik Egede
Organization
CERN-HSF
Optimisation of the Ganga toolkit in terms of memory consumption and persistent storage.
GANGA (Gaudi/Athena and Grid Alliance) is an interface used by scientists to interface with huge amount of computing power and storage available to...
Contributor
Hasan Öztürk
Mentor
Serhan Mete
Organization
CERN-HSF
Proposal for Atlas Experiment - Hasan Öztürk
Athena framework is being upgraded to run in multithreaded environment and the aim of this project is to create a new Atlas performance monitoring...
Contributor
Alfonso
Mentor
Oksana Shadura, Brian Bockelman
Organization
CERN-HSF
Novel Applications of Zstandard (ZSTD) compression algorithm to ROOT
This project aims to investigate application of ZSTD within the ROOT framework; benchmark it in comparison to the other algorithms; test it against...
Contributor
Akash Ravi
Mentor
Jakub Moscicki, Diogo Castro, Enric Tejedor, Enrico Bocchi
Organization
CERN-HSF
Package manager for Jupyter Notebook / SWAN
This proposal promises to develop a Jupyter notebook extension, that will allow the users to specify python modules (and their respective versions)...
Contributor
Arturo Garza Rodriguez
Mentor
Guilherme Amadio, Andrei Gheata
Organization
CERN-HSF
VectorFlow
A C++ adapter API for integrating vectorized components in a scalar workflow. Many FLOP-intensive algorithms may profit from the vector pipelines of...
Contributor
Jonathan Guiang
Mentor
Igor Sfiligoi, DIEGO DAVILA FOYO
Organization
CERN-HSF
Tools for Understanding CMS Data Access
Over the course of Run 2, from 2016 to 2018, the CMS detector produced an unparalleled amount of data, resulting in an intricate optimization problem...
Contributor
Mohamed Moanis Ali
Mentor
Simon Spannagel, Koen Wolters, Daniel Hynds
Organization
CERN-HSF
Implement Event based Seeding and Multi-Threading
Pursuing the goal of running Allpix-Squared simulation’s events -independent by nature- in parallel, have led to the identification of performance...
Contributor
Aman Singh Thakur
Mentor
Marlene Turner, Spencer Gessner
Organization
CERN-HSF
Building a Python-based Analysis tool for AWAKE experiment
Building a library that reads a large number of HDF files and builds a database. Add support for searching and loading multiple datasets,...
Contributor
Brooks Karlik
Mentor
Hadrien Grasland, Paul Gessinger, Andreas Salzburger
Organization
CERN-HSF
Kalman Filter in Rust
The Kalman Filter is a method of iteratively predicting the future state of a system based on previous information. Not only is a Kalman Filter more...
Contributor
Sharad Chitlangia
Mentor
Moritz Kiehn, Andreas Salzburger, David ROUSSEAU
Organization
CERN-HSF
Implementation and Optimisation in ACTS of algorithms exposed in TrackML challenge
Porting and Analysis of top solution algorithms from the TrackML challenge to ACTS framework. The algorithms include the combinatorial Mikado...
Contributor
Pradeep Kumar S
Mentor
Niko Neufeld, Daniel Cámpora
Organization
CERN-HSF
SIMT to SPMD Translation
High Level Trigger 1(HLT1) is the first and critical stage in software reconstruction of collisions at the LHCb experiment in the Large Hadron...
Contributor
Emilio Cortina Labra
Mentor
Andreas Salzburger, Edward Moyse, Riccardo Maria Bianchi
Organization
CERN-HSF
Experiment independent display framework and data format
Developing a new data format to represent event data, unifying the needs of the different experiments that will make use of it. Improvement on the...
Contributor
Arpitha Raghunandan
Mentor
Vassil Vassilev, Oksana Shadura, Yuka Takahashi
Organization
CERN-HSF
Implement a GlobalModuleIndex in ROOT and Cling
ROOT has several features which interact with libraries and require implicit header inclusion. These headers are often immutable, and reparsing is...
Contributor
Ashish Kshirsagar
Mentor
Omar Andres Zapata Mesa, Manos Stergiadis, Gerardo Gutiérrez, Sergei Gleyzer
Organization
CERN-HSF
Generative Adversarial Networks for Particle Physics Applications
The project aims implementation of GANs in the Machine Learning toolkit, TMVA of the ROOT framework would be immensely useful because of the advent,...
Contributor
Shrey Aryan
Mentor
Benedikt Hegner, Graeme Stewart, Frank Gaede
Organization
CERN-HSF
Proposal for the Implementation of an HDF5 IO Layer for PODIO
PODIO is a C++ library that allows the creation of event data models and efficient I/O code for HEP experiments. It does so by avoiding deep-object...
Contributor
Sneha Sinha
Mentor
Mayank Sharma, Maarten Litmaath
Organization
CERN-HSF
Python Components for the SIMPLE Grid Framework
The SIMPLE Grid project is an extension of the SIMPLE Framework that combines popular configuration management technologies such as Puppet/Ansible...
Contributor
Himanshu Sahu
Mentor
Michi Hostettler, kaifox, Andrea Calia
Organization
CERN-HSF
Molr - Operational
In the view of LHC Run 3, we want to extend the functionalities of Molr so that it will be ready to use in production to control various operational...
Contributor
Ruturaj Gujar
Mentor
Cedric Serfon, Martin Barisits, Tobias Wegner, Mario Lassnig
Organization
CERN-HSF
Rucio - Exascale Data Management
Rucio is a data management system that provides the functionality to organize, manage and access a large amount of scientific data (in the order of...
Contributor
Surya S Dwivedi
Mentor
Lorenzo Moneta, Sitong An, Omar Andres Zapata Mesa
Organization
CERN-HSF
Development of LSTM and GRU layers in TMVA
This project is about development of Long Short Term Memory(LSTM) and Gated Recurrent Unit(GRU) layers in TMVA, both of which belong to a general...
Contributor
Pujan Mehta
Mentor
Federico Stagni, Christophe Haen, Zoltan Mathe
Organization
CERN-HSF
Monitoring DIRAC Components
DIRAC is a highly-scalable software used for accessing distributed resources from various distributed systems. DIRAC’s main contributor is LHCb and...
Contributor
danieldo
Mentor
Costin Grigoras
Organization
CERN-HSF
Real-time conditions data distribution for the Online data processing of the ALICE experiment
ALICE (A Large Ion Collider Experiment) is a heavy-ion detector on the Large Hadron Collider (LHC) ring. It is designed to study the physics of...
Contributor
Mohit Tyagi
Mentor
Hugo Gonzalez Labrador, Jakub Moscicki, Michael D'Silva, Thomas Müller
Organization
CERN-HSF
CERNBox: Bring Your Own Application
CERNBox provides cloud data storage to all CERN users to store, share and synchronize their data across all devices. It is integrated with variety of...
Contributor
Sahil Jajodia
Mentor
Diogo Castro, Prasanth Kothuri, Piotr Mrówczyński, Enric Tejedor
Organization
CERN-HSF
Creation and usage of disposable Spark on Kubernetes cluster from SWAN notebook
This project aims to develop a Jupyter notebook plugin which deploys Spark required services to a kubernetes cluster on OpenStack cloud at CERN....
Contributor
Divya Rani
Mentor
Jakub Moscicki, Diogo Castro, João Vicente, Enric Tejedor, Enrico Bocchi
Organization
CERN-HSF
Testing framework for Jupyter notebooks
SWAN (Service for Web-based ANalysis) is a cloud data analysis service developed and powered by CERN that provides Jupyter notebooks on demand. It is...
Contributor
Inzamam Iqbal
Mentor
Alexander Richards, Mark Smith, Ulrik Egede
Organization
CERN-HSF
Create a user interface for Ganga that allows for the execution of tasks inside user specified virtual machines.
Ganga is used to execute a user defined computational task on a distributed back-end. Through this project we let the users define the environment in...
Contributor
Amarnath Karthi
Mentor
Federico Stagni, Christophe Haen, Zoltan Mathe
Organization
CERN-HSF
Distributed Computing Resources: aggregation, usage, monitoring
DIRAC is an open source interware platform whose roles are submission of jobs, the management of the data produced, to the orchestration of the...
Contributor
Abhishek Chauhan
Mentor
Julius Hrivnac, Julien Peloton, Chris Arnault
Organization
CERN-HSF
Alert Redistribution System for Fink : an Apache Spark based Broker for Astronomy
A huge volume of data is generated every night by large astronomical telescopes around the world. A robust and scalable software infrastructure is...
Contributor
Jack Qiu
Mentor
Vassil Vassilev, Oksana Shadura, Aleksandr Efremov
Organization
CERN-HSF
Generating Hessians and Jacobians via CLAD
Clad is a C++ Clang compiler plugin that employs automatic differentiation to derive user-defined functions, performing source code transformations...