ThetaOne-AI/HiKE

Hierarchical Korean-English Code-Switching Speech Recognition Benchmark (EACL Findings 2026, To Appear) | 한영 혼용 음성인식 벤치마크

/ 100

Emerging

This is a benchmark and evaluation tool for assessing how well Automatic Speech Recognition (ASR) models transcribe speech that mixes Korean and English. It takes audio files containing mixed-language speech and outputs detailed error rates, showing where models struggle with word, phrase, or sentence-level code-switching and loanwords. ASR developers and researchers can use this to rigorously test and improve their models' performance on challenging bilingual audio.

Use this if you are developing or evaluating ASR models and need a standardized, high-quality benchmark to measure their accuracy on Korean-English code-switching speech.

Not ideal if you are a casual user simply looking to transcribe Korean-English mixed speech without needing to evaluate model performance or develop new ASR systems.

speech-recognition-development bilingual-ai korean-english-language model-evaluation natural-language-processing-research

No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 15 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights