mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
This project helps roboticists and engineers enable their robots to understand spoken commands and transcribe audio in real-time. It takes live audio input from a robot's microphone, processes it to detect when someone is speaking, and then converts the speech into text. The output is a stream of transcribed text that the robot can then use for interaction or task execution.
Use this if you are developing ROS 2-based robots that need to interpret human speech and respond to voice commands in real-time.
Not ideal if you need to analyze pre-recorded audio files or if your robotic system does not use the ROS 2 framework.
Stars
91
Forks
21
Language
C++
License
MIT
Category
Last pushed
Mar 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mgonzs13/whisper_ros"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
vilassn/whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models
ChetanXpro/nodejs-whisper
NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++...
mybigday/whisper.rn
React Native binding of whisper.cpp.