liangstein/Chinese-speech-to-text
Chinese Speech To Text Using Wavenet
This project transcribes spoken Mandarin Chinese from audio files into written Chinese characters. You provide an audio recording of someone speaking Mandarin, and it outputs the corresponding text. This tool is for anyone who needs to quickly convert spoken Chinese into a readable, searchable text format, such as researchers working with spoken data or individuals needing to document conversations.
163 stars. No commits in the last 6 months.
Use this if you need to convert clear, recorded Mandarin speech into text, especially for general key phrases and sentences.
Not ideal if you require highly accurate transcription in noisy environments or for specialized vocabulary, as it's trained on a relatively small dataset.
Stars
163
Forks
55
Language
Python
License
Apache-2.0
Category
Last pushed
May 03, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/liangstein/Chinese-speech-to-text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
louiskirsch/speechT
An opensource speech-to-text software written in tensorflow
Open-Speech-EkStep/vakyansh-models
Open source speech to text models for Indic Languages
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Open-Speech-EkStep/vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
juliuskunze/speechless
Speech-to-text based on wav2letter built for transfer learning