786440445/ASR_DFCNN_Transformer

1. ctc的DCNN声学模型+语言模型和 transformer的端到端模型

25
/ 100
Experimental

This project helps convert spoken Chinese into written text using advanced deep learning. It processes audio files from various Chinese speech corpuses and outputs transcribed text. This is designed for researchers or practitioners working on Chinese speech-to-text applications, such as voice assistants, transcription services, or language processing tools.

No commits in the last 6 months.

Use this if you need a robust, pre-trained model for transcribing spoken Chinese from several common datasets into text.

Not ideal if you are working with languages other than Chinese or require real-time, low-latency transcription without pre-trained models.

Chinese-speech-recognition audio-transcription natural-language-processing voice-AI
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 13 / 25

How are scores calculated?

Stars

8

Forks

2

Language

Python

License

Last pushed

Dec 08, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/786440445/ASR_DFCNN_Transformer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.