CMU Sphinx

Fast, Accurate, Flexible Open Source Speech Recognition

Technologies
c, python, javascript, cross-platform, hidden markov models
Topics
education, real time, user interface, speech recognition, pronunciation
Fast, Accurate, Flexible Open Source Speech Recognition

Since being released as open source code in 1999, CMU Sphinx provides a platform for building speech recognition applications. It's used in desktop control software, telephony platforms, intelligent houses, computer-assisted language learning tools, information retrieval and mobile applications. Traditionally, CMUSphinx provides support for low-resource and underdeveloped languages.

2017 Program

Successful Projects

Contributor
Sahith Dambekodi
Mentor
ronanki, lanceculnane, James Salsman, Patrick Morris-Suzuki, prnk28
Organization
CMU Sphinx
General Computer-Assisted Learning including Speaking Skills
This proposal is for the implementation of Computer-Aided Language Learning from the browser using Pocketsphinx.js. Using Twee/Twine as the question...
Contributor
Yifan Xiong
Mentor
John Mark Vandenberg, Imran Sheikh, Arseniy Gorin
Organization
CMU Sphinx
Collect Pronunciation Dictionaries from Wiktionary
This Collect Pronunciation Dictionaries from Wiktionary project aims to expand pronunciation dictionaries in CMUSphinx for new words and multiple...
Contributor
Vishal Agrawal
Mentor
Bhiksha Raj, Alexander Rudnicky
Organization
CMU Sphinx
Large Vocabulary Continuous Speech Recognition using RNNs
Automatic speech recognition systems are traditionally based on Hidden Markov models(HMMs) as it had given the best results in the past. However due...
Contributor
S SINGH
Mentor
Tom Hartung, James Salsman
Organization
CMU Sphinx
Leaderboard Population, Sorting, and Column Maintenance
The project scope includes designing a web based leader board module for CMU Sphinx which would include details of top ranking people, models,...
Contributor
Pavel Denisov
Mentor
James Salsman, Christopher Bader
Organization
CMU Sphinx
Diphone alignment and acoustic scores
The goal of this project is to extend PocketSphinx so that it can calculate and provide alignments and acoustic scores for diphones similarly to how...
Contributor
Muhammad Shah
Mentor
Bhiksha Raj, Nikolay Shmyrev
Organization
CMU Sphinx
DNNs for Sphinx
In the past decade Deep Neural Networks (DNNs) have been be demonstrated to excel to wide range of learning tasks, including speech recognition. More...
Contributor
Hammad Abdullah
Mentor
Bhiksha Raj, Nikolay Shmyrev
Organization
CMU Sphinx
MLP incorporation in Sphinx
CMUsphinx is a fast and flexible open source speech recognition toolkit. Currently it uses GMM acoustic models and it gives reasonable accuracy. This...
Contributor
Brij
Mentor
lanceculnane, James Salsman, Patrick Morris-Suzuki, ronanki
Organization
CMU Sphinx
Pronunciation Intelligibility Remediation with Pocketsphinx.js
The proposed framework will implement a user interface capable of collecting audio signal from microphone corresponding to a prompted text and...
Contributor
Rishi Rajasekaran
Mentor
lanceculnane, James Salsman
Organization
CMU Sphinx
Transcriber Qualification
The aim of this project is to write support code for obtaining accurate transcriptions, exemplar pronunciations, and phonetic and part-of-speech...
Contributor
Mritunjay Goutam
Mentor
James Salsman, Carl Fredrik Sjöland
Organization
CMU Sphinx
Javascript simulation of the brain’s motor control of vocal tract-GSoC 2017 proposal
As mention in the title, it is a 3D animation model of the motor control of speech production. Our long range goal is to implement the model...
Contributor
Pankaj Baranwal
Mentor
Sarah Elliott, Arseniy Gorin
Organization
CMU Sphinx
ROS Integration for pocketsphinx
ROS has support for pocketsphinx, but it is very initial stage. So, this project aims to update the ROS package for pocketsphinx with a full-scale...