CERN-HEPSoftwareFoundation, umbrella organization for particle physics in GSoC

CERN-HSF (High Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation ( facilitates the coordination of common international efforts in high-energy physics software and computing.

CERN (European Organization for Nuclear Research, has participated in GSoC since 2011 as CERN-SFT group that provides common software for CERN's experiments. This year the program has expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge.

The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments have announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software.

The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: detector and accelerator simulation (Falcon, Geant4, GeantV, Pythia, SixTrack), data analysis (ROOT, TMVA, scikit-HEP, Go-HEP), event reconstruction (CMS Tracking), data management (transfer2go) and many others. We look forward to your contributions!

lightbulb_outline View ideas list


  • python
  • c/c++
  • machine learning
  • data analysis
  • parallel algorithms


email Mailing list
mail_outline Contact email

CERN-HSF 2017 Projects

  • George Papadrosou
    Assess ROOT's asynchronous data prefetching
    Assess the performance of ROOT's asynchronous data prefetching compared to standard prefetching mechanism for reading remote ROOT files.
  • Kunal_Singh_
    Automatic code generation for fast inference in the CMS tracking software
    Extracting shape information from simulated CMS events Using Tensorflow to train a DNN based on the shape of the hits, detector information and...
  • Krishnan R
    Big Data Tools for Physics Analysis
    Jupyter Notebook is an interactive computing environment that is used to create notebooks which contain code, output, plots, widgets and theory....
  • Saurav Shekhar
    Convolutional Deep Neural Networks on GPUs for Particle Physics Applications
    Toolkit for Multivariate Analysis (TMVA) is a machine learning toolkit for the ROOT scientific software framework. It contains a variety of...
  • ajatgd
    Convolutional Deep Neural Networks on GPUs for Particle Physics Applications
    In the last few years, deep learning has led to very good performance on a variety of problems, such as visual recognition, speech recognition and...
  • Vladimir Ilievski
    Convolutional Deep Neural Networks on GPUs for Particle Physics Applications
    The Convolutional Neural Networks (CNNs) are one special type of a deep learning neural networks with an enormous discriminative power for image...
  • Vyom Sharma
    Deep Anomaly
    This project is aimed towards building a framework which monitors incoming ATLAS Computing Operations data for anomalies and then autonomously, acts...
  • Deepanshu Thakur
    Efficient Python routines for analysis on massively multi-threaded platforms-Python bindings for the Hydra C++ library
    ‘Effective Python routines for analysis on massively multi-threaded platforms-Python bindings for the Hydra C++ library'. The title of this project...
  • vikas gupta
    Electron-ion collision in Pythia8
    PYTHIA is used for generation of events in high-energy collisions. Originally developed in fortran until Pythia6.4, it was rewritten in C++ starting...
  • Alejandro García Montoro
    Improvements in vectorization and parallelization of ROOT Math libraries
    ROOT is a data analysis software used by scientists all over the world; therefore, its efficiency is critical for the scientific community in general...
  • Sami Ullah
    Integration of TMVA and OpenML platform
    OpenML is an open source project that aims to create a novel ecosystem for machine learning experimentation. It is possible to integrate different...
  • Bastian Wieck
    Jet Clustering Optimizations in Fads
    Fads is a fast detector simulation toolkit in Go used for High Energy Physics analyses. The current version is not very scalable and it takes up too...
  • Vibhav Pant
    Launching Wagon, a WebAssembly interpreter in Go
    WebAssembly (wasm) is a low level bytecode binary file format made for execution on the Web, designed as a compilation target for various programming...
  • Brian Lui
    Machine Learning Project: Multi-target Regression for Particle Physics
    Toolkit for Multivariate Data Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software network, used...
  • Dmitry Sorokin
    New error control methods for integration of trajectories
    Track of a charged particle in a non-uniform electromagnetic field can be described by the first-order initial value problem (IVP). Robustness and...
  • Rishi Shah
    Next generation of CMS data replication system
    The CMS (Compact Muon Solenoid) is a high-energy physics experiment at the LHC. The CMS PhEDEx(Physics Experiment Data Export) project is responsible...
  • Vitalii Drohan
    Optimisation of GeantV HPC workload balancing
    This is proposal for implementing suggested HPC aware infrastructure with workload balancer.
  • Somesh Singh
    Smart Data Structures in CUDA
    The CUDA language, used for programming NVIDIA GPUs, is not dynamic-data-structure friendly since it was designed for working with data structures of...
  • Dong-hee Na
    Updating gopy to support Python3 and PyPy
    gopy is an excellent tool which generates (and compiles) a CPython extension module from a go package. And I hope more developers could make full use...
  • Dima Savin
    Using Pseudo-random number repeatably in a fine-grain multithreaded simulation
    Particle transport Monte Carlo simulations are a key tool for High Energy Physics experiments, including the LHC experiments at CERN. All Monte Carlo...