Audio-WestlakeU/VINP

Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification' [IEEE TASLP]

/ 100

Emerging

This project helps audio engineers and researchers improve the clarity of speech recordings and understand the acoustic properties of a recording environment. It takes in reverberant speech audio and outputs a much clearer version of the speech, along with an estimated room impulse response (RIR) that describes how sound behaves in that space. This is ideal for professionals working with audio analysis or speech recognition.

Use this if you need to clean up noisy, echoey speech recordings to make them more understandable or to improve the accuracy of automatic speech recognition systems.

Not ideal if your primary goal is general noise reduction from sources other than reverberation, or if you do not require detailed room acoustic identification.

audio-enhancement speech-processing acoustic-analysis speech-recognition room-acoustics

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights