shenasa-ai/speech2text

A Deep-Learning-Based Persian Speech Recognition System

/ 100

Emerging

This project offers tools and datasets for converting spoken Persian language into written text. It helps data scientists and machine learning engineers working with Persian audio, providing both code for an Automatic Speech Recognition (ASR) system and large datasets of Persian speech with transcriptions. You feed it audio files, and it outputs corresponding text, which can then be used for various applications.

234 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer or data scientist looking to build or train a Persian speech-to-text system, and you need data or a starting point for implementation.

Not ideal if you are an end-user simply needing to transcribe audio without deep technical knowledge of machine learning, or if you need a ready-to-use commercial-grade ASR API.

Persian language processing speech recognition audio transcription dataset creation natural language processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

234

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights