This project aims to build an ASR pipeline for European Language (German) and it must be built as a Singularity container on the Case HPC and put into production to process daily incoming files. The project proposes to use different state-of-the-art Speech to Text open source toolkits (Kaldi, DeepSpeech).

It can be divided into following sub-tasks:

  1. Data cleaning and Feature Extraction
  2. Language Modeling
  3. Building Phoneme Dictionary
  4. Acoustic Modeling

Organization

Student

Aashish Agarwal

Mentors

  • Elmar Noeth
  • Sumit Vohra
  • Jan Gorisch
close

2019