andybi7676/reborn-uasr

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

/ 100

Experimental

This project helps speech researchers and language model developers transcribe speech into phoneme sequences without needing extensive labeled audio data. It takes raw audio recordings, such as those from the LibriSpeech datasets in English or various other languages, and outputs the underlying phonemic structure of the speech. This is useful for building speech recognition systems or analyzing speech sounds.

No commits in the last 6 months.

Use this if you need to analyze or transcribe audio data in a language where labeled speech data for traditional ASR training is scarce or unavailable.

Not ideal if you already have a large, labeled dataset for your target language and can train a supervised ASR model.

speech-recognition phoneme-transcription language-modeling unsupervised-learning audio-analysis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights