mravanelli/pySpeechRev
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.
This tool helps researchers and engineers in speech technology create realistic reverberated speech datasets. It takes a collection of clean, close-talking speech recordings and applies various room impulse responses to them. The output is a new dataset of speech files that sound as if they were recorded in different acoustic environments, useful for training robust speech recognition systems.
No commits in the last 6 months.
Use this if you need to generate large amounts of acoustically diverse speech data to make your speech technology more robust to real-world conditions like echoing rooms.
Not ideal if you're looking for real-time speech processing or interactive sound design, as this tool is designed for offline dataset generation.
Stars
97
Forks
25
Language
Python
License
—
Category
Last pushed
May 30, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mravanelli/pySpeechRev"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
cmusphinx/pocketsphinx
A small speech recognizer
tensorflow/lingvo
Lingvo
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition