techiaith/docker-huggingface-stt-cy
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace
This tool helps convert spoken Welsh (and some English) into written text, useful for transcribing audio or video files. It takes your Welsh speech recordings and produces a text file of what was said. Anyone needing to process Welsh audio, such as researchers, content creators, or language educators, can use this for transcription.
No commits in the last 6 months.
Use this if you need to accurately transcribe spoken Welsh from audio files, including extracting text from videos, or if you're developing applications that require Welsh speech-to-text functionality.
Not ideal if you primarily need speech recognition for languages other than Welsh or English, or if you require extremely high accuracy for very noisy audio without further model training.
Stars
13
Forks
4
Language
Python
License
MIT
Category
Last pushed
Nov 29, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/techiaith/docker-huggingface-stt-cy"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be...
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models