TeaPoly/cat_tensorflow

Crf-based Asr Toolkit with TensorFlow implement

/ 100

Experimental

This toolkit helps speech researchers and machine learning engineers train custom Automatic Speech Recognition (ASR) acoustic models. It takes audio data and corresponding text transcripts to produce a trained model that can convert spoken language into written text. This is designed for practitioners who need to develop and fine-tune ASR systems, particularly those using Conditional Random Fields (CRF) and TensorFlow.

No commits in the last 6 months.

Use this if you are a speech researcher or machine learning engineer focused on developing high-performance ASR acoustic models using TensorFlow and CRF, and you are comfortable with configuring CUDA environments.

Not ideal if you are looking for an out-of-the-box ASR solution or a tool that doesn't require deep technical knowledge of TensorFlow, CUDA, and ASR model training pipelines.

Automatic Speech Recognition ASR model training speech processing acoustic modeling natural language processing

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights