Contributor
Anuj Solanki

Enhance Speech Recognition Demo for AGL


Mentors
Walt Miner, Jan-Simon Möller, Scott Murray, Marius Vlad
Organization
The Linux Foundation
Technologies
python, linux, flutter, dart, grpc, NLU
Topics
machine learning, cloud, nlp
The goal of this project is to enhance the speech recognition demo in Automotive Grade Linux by integrating Whisper AI for improved accuracy and expanding the natural language understanding (NLU) capabilities. Key Deliverables: Implement Whisper AI for speech-to-text conversion in both online and offline modes. Expand NLU to support more voice commands by integrating APIs like soundmanager, weather etc. Modify the existing Flutter app to accommodate the new features. Implement audible feedback for command execution. Potentially integrate voice control with other apps/services like navigation, phone, Wi-Fi (stretch goal).