Implement a Pipeline to Extract and Transform GDC Data
- Mentors
- Benjamin Gross, Angelica Ochoa, Zachary Heins
- Organization
- cBioPortal for Cancer Genomics
The goal of this project is to extract Cancer Genomic data available from NCI's Genomic Data Commons and transform them according to the file formats required by cBioPortal. There is currently no ET pipeline existing to import data from NCI data repository and this project will add this feature to cBioPortal. The user will simply need to run a Batch that will transform genomic data from GDC into accepted file formats.