githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
When you have an AI model that recognizes sequences like text from an image or spoken words, this tool helps translate the model's raw output into actual words and sentences. It takes the character probabilities from your model and, using a dictionary and optionally a language model, produces a clear, readable text string. This is for AI practitioners, data scientists, or researchers who are building or evaluating sequence recognition systems.
577 stars.
Use this if you need to accurately convert your AI model's character-level predictions into a coherent string of words, especially when dealing with noisy inputs or out-of-vocabulary terms.
Not ideal if your recognition task does not involve sequences of characters that form words, or if you do not have a dictionary or language model to leverage.
Stars
577
Forks
159
Language
C++
License
MIT
Category
Last pushed
Jan 31, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/githubharald/CTCWordBeamSearch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation