“Htsget retrieval API spec” [1] is a specification by the Global Alliance for Genomics and Health (GA4GH) for genomics bulk data transfers. This is a Streaming API that allows users to query remote services for regions of interest of genomic data, instead of having to download gigabytes of data and filter afterwards. The results are retrieved in formats that can be consumed by widely used bioinformatics tools. The specification currently supports BAM and CRAM formats.
Variant Call Format (VCF) is the standard for describing genomic variation in bioinformatics. The goal of the project is to extend the specification to support the VCF format and provide an implementation.

Student

Amila Silva

Mentors

  • Cristina Yenyxe Gonzalez
  • Pablo Arce Garcia
close

2017