jopedroliveira/speech_recog_uc
Speech processing ROS-package. Performs speech recognition and estimates the direction of arrival based on a real-time voice activity detection mechanism.
This package enables robots to understand spoken commands and locate where the sound is coming from in real-time. It takes in audio input and outputs transcribed speech and an estimated direction of arrival for the sound source. Robotics engineers or researchers building interactive robots that need to respond to voice commands would find this useful.
No commits in the last 6 months.
Use this if you are developing a robot that needs to reliably detect speech, understand what is being said, and pinpoint the speaker's location with minimal computational effort.
Not ideal if your application does not involve robots, real-time audio processing, or the Robot Operating System (ROS).
Stars
13
Forks
9
Language
C++
License
MIT
Category
Last pushed
Apr 09, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jopedroliveira/speech_recog_uc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
cmusphinx/pocketsphinx
A small speech recognizer
tensorflow/lingvo
Lingvo
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition