786440445/ASR_DFCNN_Transformer
1. ctc的DCNN声学模型+语言模型和 transformer的端到端模型
This project helps convert spoken Chinese into written text using advanced deep learning. It processes audio files from various Chinese speech corpuses and outputs transcribed text. This is designed for researchers or practitioners working on Chinese speech-to-text applications, such as voice assistants, transcription services, or language processing tools.
No commits in the last 6 months.
Use this if you need a robust, pre-trained model for transcribing spoken Chinese from several common datasets into text.
Not ideal if you are working with languages other than Chinese or require real-time, low-latency transcription without pre-trained models.
Stars
8
Forks
2
Language
Python
License
—
Category
Last pushed
Dec 08, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/786440445/ASR_DFCNN_Transformer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...
githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)