tensorflow_end2end_speech_recognition and kaggle_speech_recognition
About tensorflow_end2end_speech_recognition
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
This project helps researchers and developers build custom speech recognition systems. It takes audio recordings from popular speech datasets like TIMIT, LibriSpeech, or CSJ, and processes them to output text transcripts. It's designed for someone specializing in machine learning or natural language processing who needs to experiment with advanced end-to-end speech recognition models.
About kaggle_speech_recognition
huschen/kaggle_speech_recognition
Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.
This project helps you automatically detect simple spoken commands, like "yes" or "no", from audio recordings. It takes sound wave files as input and outputs the recognized command as text. This tool is designed for anyone needing to build or evaluate a system that understands specific voice commands.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work