anhvung/Capstone-Audio-Transcription

Exploring different ASR and language models for audio transcription

12
/ 100
Experimental

This project helps researchers and product developers create accurate text transcripts from audio recordings. It takes audio files, even those with background noise or different accents, and converts them into written text. It's ideal for anyone who needs to convert spoken language into text for analysis, search, or documentation.

No commits in the last 6 months.

Use this if you need to transcribe audio content and want to fine-tune a model for better accuracy on specific types of speech, accents, or languages.

Not ideal if you just need basic, off-the-shelf transcription and don't plan to customize or improve the model's performance with your own data.

audio-transcription speech-to-text research-analysis multilingual-support content-moderation
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Jupyter Notebook

License

Last pushed

Dec 18, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/anhvung/Capstone-Audio-Transcription"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.