manishdhakal/ASR-Nepali-using-CNN-BiLSTM-ResNet
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet - IEEE (ICICT - 2022)
This project offers an automatic speech recognition (ASR) system specifically for the Nepali language. It takes spoken Nepali audio inputs and converts them into written Nepali text. This system is designed for researchers and developers working on Nepali language technology to build or improve speech-to-text applications.
No commits in the last 6 months.
Use this if you are a researcher or developer focused on building or enhancing speech-to-text capabilities for the Nepali language.
Not ideal if you need a production-ready, off-the-shelf Nepali ASR solution without any further development or research.
Stars
24
Forks
10
Language
Python
License
MIT
Category
Last pushed
Nov 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/manishdhakal/ASR-Nepali-using-CNN-BiLSTM-ResNet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...
githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)