mgonzs13/whisper_ros

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

/ 100

Established

This project helps roboticists and engineers enable their robots to understand spoken commands and transcribe audio in real-time. It takes live audio input from a robot's microphone, processes it to detect when someone is speaking, and then converts the speech into text. The output is a stream of transcribed text that the robot can then use for interaction or task execution.

Use this if you are developing ROS 2-based robots that need to interpret human speech and respond to voice commands in real-time.

Not ideal if you need to analyze pre-recorded audio files or if your robotic system does not use the ROS 2 framework.

robotics voice control speech recognition human-robot interaction robot programming

No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

C++

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App

Compare

whisper_ros and whisper.cpp

Related tools

ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

sandrohanea/whisper.net

Whisper.net. Speech to text made simple using Whisper Models

ChetanXpro/nodejs-whisper

NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++...

mybigday/whisper.rn

React Native binding of whisper.cpp.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights