DeepSwissVoice/DeepVoice

A TensorFlow implementation of Baidu's DeepSpeech architecture

/ 100

Emerging

This project helps convert spoken audio into written text, making it easier to transcribe meetings, dictate documents, or create captions for videos. You feed it audio recordings, and it produces a text transcript. It's designed for anyone who needs to quickly and accurately convert speech to text without manual typing.

No commits in the last 6 months.

Use this if you need an automated way to transcribe audio files into text.

Not ideal if you require real-time speech-to-text for live conversations or extremely specialized vocabulary that isn't typically covered by general speech models.

audio-transcription voice-to-text dictation captioning meeting-minutes

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

C++

License

MPL-2.0

Higher-rated alternatives

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights