shenasa-ai/speech2text

A Deep-Learning-Based Persian Speech Recognition System

44
/ 100
Emerging

This project offers tools and datasets for converting spoken Persian language into written text. It helps data scientists and machine learning engineers working with Persian audio, providing both code for an Automatic Speech Recognition (ASR) system and large datasets of Persian speech with transcriptions. You feed it audio files, and it outputs corresponding text, which can then be used for various applications.

234 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer or data scientist looking to build or train a Persian speech-to-text system, and you need data or a starting point for implementation.

Not ideal if you are an end-user simply needing to transcribe audio without deep technical knowledge of machine learning, or if you need a ready-to-use commercial-grade ASR API.

Persian language processing speech recognition audio transcription dataset creation natural language processing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

234

Forks

33

Language

Jupyter Notebook

License

MIT

Last pushed

May 22, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shenasa-ai/speech2text"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.