popcornell/MicRank

MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

/ 100

Experimental

MicRank helps improve the accuracy of distant speech recognition systems by automatically identifying the most effective microphones in a multi-microphone setup. It takes audio streams from various microphones and outputs a ranked list, allowing you to select the best channels for processing. This tool is ideal for engineers and researchers developing voice assistants, meeting transcription systems, or other applications relying on robust speech detection from multiple audio sources.

No commits in the last 6 months.

Use this if you need to optimize speech recognition performance in environments with multiple, potentially unsynchronized microphones, where some channels might be noisy or distant from the speaker.

Not ideal if your application uses a single microphone or if you require sample-level synchronization between devices for other processing steps.

distant-speech-recognition audio-channel-selection voice-user-interface acoustic-signal-processing multi-microphone-arrays

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights