modelscope/ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

/ 100

Emerging

This toolkit helps anyone working with audio clean up speech recordings. You can input noisy speech, mixed voices, or low-quality audio, and it will output clearer speech, separated voices, or high-resolution audio. It's designed for audio engineers, podcasters, transcribers, or anyone needing to improve speech clarity.

3,962 stars. No commits in the last 6 months.

Use this if you need to remove background noise, separate individual speakers from a conversation, or enhance the quality and bandwidth of recorded speech.

Not ideal if your primary goal is general audio editing or music processing, as this tool is specifically focused on speech.

audio-post-production speech-enhancement voice-separation audio-restoration podcast-production

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

3,962

Forks

325

Language

Python

License

Apache-2.0

Higher-rated alternatives

espnet/espnet

End-to-End Speech Processing Toolkit

yeyupiaoling/PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。

flashlight/wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Explore Voice AI Tools

All categories Trending Voice AI directory Insights