tensorflow_end2end_speech_recognition and ctc-asr

ctc-asr
47
Emerging
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 24/25
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 21/25
Stars: 314
Forks: 119
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 123
Forks: 36
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stale 6m No Package No Dependents
Stale 6m No Package No Dependents

About tensorflow_end2end_speech_recognition

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

This project helps researchers and developers build custom speech recognition systems. It takes audio recordings from popular speech datasets like TIMIT, LibriSpeech, or CSJ, and processes them to output text transcripts. It's designed for someone specializing in machine learning or natural language processing who needs to experiment with advanced end-to-end speech recognition models.

speech-to-text natural-language-processing machine-learning-research audio-transcription voice-technology

About ctc-asr

mdangschat/ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

This is an automatic speech recognition (ASR) system that converts spoken audio into written text. You provide audio files (WAV format) and the system outputs their transcriptions. This tool is for researchers, linguists, or anyone needing to convert large audio datasets into text for analysis or further processing.

speech-to-text audio-transcription linguistics-research voice-data-processing

Scores updated daily from GitHub, PyPI, and npm data. How scores work