xingchensong/Speech-Transformer-tf2.0

transformer for ASR-systerm (via tensorflow2.0)

/ 100

Emerging

This project offers a way to convert spoken audio into written text. You feed it recordings of speech, and it produces the corresponding sequence of characters. It's designed for researchers and engineers working on automatic speech recognition systems who need a robust model for transcribing spoken language.

114 stars. No commits in the last 6 months.

Use this if you are developing an automatic speech recognition system and need a model that can directly translate acoustic features into text.

Not ideal if you're looking for a ready-to-use application for transcription without needing to integrate it into a larger system or work with acoustic features.

speech-recognition audio-transcription natural-language-processing linguistics voice-technology

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 21 / 25

How are scores calculated?

Stars

114

Forks

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights