tensorflow_end2end_speech_recognition and kaggle_speech_recognition

Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 24/25
Maintenance 0/25
Adoption 9/25
Maturity 16/25
Community 19/25
Stars: 314
Forks: 119
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 72
Forks: 20
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stale 6m No Package No Dependents
Stale 6m No Package No Dependents

About tensorflow_end2end_speech_recognition

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

This project helps researchers and developers build custom speech recognition systems. It takes audio recordings from popular speech datasets like TIMIT, LibriSpeech, or CSJ, and processes them to output text transcripts. It's designed for someone specializing in machine learning or natural language processing who needs to experiment with advanced end-to-end speech recognition models.

speech-to-text natural-language-processing machine-learning-research audio-transcription voice-technology

About kaggle_speech_recognition

huschen/kaggle_speech_recognition

Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.

This project helps you automatically detect simple spoken commands, like "yes" or "no", from audio recordings. It takes sound wave files as input and outputs the recognized command as text. This tool is designed for anyone needing to build or evaluate a system that understands specific voice commands.

speech-recognition voice-commands audio-processing natural-language-processing human-computer-interaction

Scores updated daily from GitHub, PyPI, and npm data. How scores work