tensorflow_end2end_speech_recognition and kaggle_speech_recognition

tensorflow_end2end_speech_recognition

Established

kaggle_speech_recognition

Emerging

Maintenance 0/25

Adoption 10/25

Maturity 16/25

Community 24/25

Maintenance 0/25

Adoption 9/25

Maturity 16/25

Community 19/25

Stars: 314

Forks: 119

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

Stars: 72

Forks: 20

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

Stale 6m No Package No Dependents

About tensorflow_end2end_speech_recognition

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

This project helps researchers and developers build custom speech recognition systems. It takes audio recordings from popular speech datasets like TIMIT, LibriSpeech, or CSJ, and processes them to output text transcripts. It's designed for someone specializing in machine learning or natural language processing who needs to experiment with advanced end-to-end speech recognition models.

speech-to-text natural-language-processing machine-learning-research audio-transcription voice-technology

About kaggle_speech_recognition

huschen/kaggle_speech_recognition

Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.

This project helps you automatically detect simple spoken commands, like "yes" or "no", from audio recordings. It takes sound wave files as input and outputs the recognized command as text. This tool is designed for anyone needing to build or evaluate a system that understands specific voice commands.

speech-recognition voice-commands audio-processing natural-language-processing human-computer-interaction

Related comparisons

tensorflow_end2end_speech_recognition and tensorflow-ctc-speech-recognition tensorflow_end2end_speech_recognition and ctc-asr tensorflow_end2end_speech_recognition and tensorflow-ctc-speech-recognition

Scores updated daily from GitHub, PyPI, and npm data. How scores work