AppleHolic/2020AIChallengeSpeechRecognition
2020 AI Challenge 음성 인식 코드
This project provides pre-trained models for converting spoken language into text, specifically optimized for challenging audio. It takes audio recordings, especially those with background noise or children's voices, and outputs highly accurate written transcripts. This is designed for researchers or practitioners working on speech-to-text systems that need to handle difficult real-world audio environments.
No commits in the last 6 months.
Use this if you need to accurately transcribe speech from audio recordings that contain significant background noise or feature children's voices.
Not ideal if your primary need is for a general-purpose speech recognition system that doesn't focus on these specific challenging audio types.
Stars
8
Forks
—
Language
Python
License
MIT
Category
Last pushed
Nov 29, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AppleHolic/2020AIChallengeSpeechRecognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project