speechio/BigCiDian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
This project helps build Automatic Speech Recognition (ASR) systems that accurately understand both English and Chinese words, especially when they are mixed, like proper nouns or brand names. It takes lists of English and Chinese words and outputs a unified pronunciation guide (lexicon) for your ASR system. This is useful for speech engineers or linguists creating robust ASR models for bilingual users.
262 stars. No commits in the last 6 months.
Use this if you need a consistent pronunciation lexicon for building or improving ASR systems that handle both English and Chinese speech, particularly mixed language input.
Not ideal if your ASR application only processes a single language or if you require a lexicon for languages other than English or Chinese.
Stars
262
Forks
55
Language
Python
License
—
Category
Last pushed
Oct 11, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/speechio/BigCiDian"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
qianchang/zici
字词:收集国学/汉语字词拼音相关资源
gheyret/UQSpeechDataset
Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット
apluka34/Bud500
Bud500: A Comprehensive Vietnamese ASR Dataset
harisbinzia/PronouncUR
PronouncUR: An Urdu Pronunciation Lexicon Generator
jonsafari/buckeye_dict
Buckeye Pronunciation Dictionary