RF5/transfusion-asr

Transcribing Speech with Multinomial Diffusion, training code and models.

33
/ 100
Emerging

This project offers a powerful tool for converting spoken audio into written text. You provide audio files (like recordings of meetings or interviews), and it outputs a highly accurate text transcription. It is designed for researchers, data scientists, or anyone working with large volumes of speech data who needs to automatically generate transcripts.

No commits in the last 6 months.

Use this if you need to transcribe spoken language from audio files into text with high accuracy, especially for research or data analysis purposes.

Not ideal if you're looking for a simple, off-the-shelf transcription service without any technical setup or if your main goal is real-time transcription.

speech-to-text audio-transcription natural-language-processing data-labeling computational-linguistics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

80

Forks

5

Language

Python

License

Last pushed

Sep 27, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/RF5/transfusion-asr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.