Enhance Speech Recognition Demo for AGL
- Mentors
- Walt Miner, Jan-Simon Möller, Scott Murray, Marius Vlad
- Organization
- The Linux Foundation
- Technologies
- python, linux, flutter, dart, grpc, NLU
- Topics
- machine learning, cloud, nlp
The goal of this project is to enhance the speech recognition demo in Automotive Grade Linux by integrating Whisper AI for improved accuracy and expanding the natural language understanding (NLU) capabilities.
Key Deliverables:
Implement Whisper AI for speech-to-text conversion in both online and offline modes.
Expand NLU to support more voice commands by integrating APIs like soundmanager, weather etc.
Modify the existing Flutter app to accommodate the new features.
Implement audible feedback for command execution.
Potentially integrate voice control with other apps/services like navigation, phone, Wi-Fi (stretch goal).