tensorflow_end2end_speech_recognition and ctc-asr
About tensorflow_end2end_speech_recognition
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
This project helps researchers and developers build custom speech recognition systems. It takes audio recordings from popular speech datasets like TIMIT, LibriSpeech, or CSJ, and processes them to output text transcripts. It's designed for someone specializing in machine learning or natural language processing who needs to experiment with advanced end-to-end speech recognition models.
About ctc-asr
mdangschat/ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
This is an automatic speech recognition (ASR) system that converts spoken audio into written text. You provide audio files (WAV format) and the system outputs their transcriptions. This tool is for researchers, linguists, or anyone needing to convert large audio datasets into text for analysis or further processing.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work