Voice AI Learning Collections Voice AI Tools

Educational Python repositories and coding practice collections covering diverse domains (utilities, automation, tutorials). Does NOT include specialized voice-AI tools, production applications, or focused libraries for specific tasks like TTS/ASR.

There are 57 voice ai learning collections tools tracked. 4 score above 50 (established tier). The highest-rated is Spr-Aachen/Easy-Voice-Toolkit at 60/100 with 875 stars. 1 of the top 10 are actively maintained.

Get all 57 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=voice-ai-learning-collections&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription,...

60
Established
2 PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

52
Established
3 alphacep/awesome-russian-speech

Russian speech technology links

52
Established
4 ftyers/commonvoice-utils

Linguistic processing for Common Voice

51
Established
5 microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

47
Emerging
6 microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

46
Emerging
7 GuillaumeFalourd/formulas-python

Ritchie CLI formulas in Python 🐍

45
Emerging
8 inclusionAI/Ming-UniAudio

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing...

44
Emerging
9 metame-ai/awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation,...

43
Emerging
10 alsrb0607/KoreanSTT

kospeech를 활용한 한국어 음성 인식 모델 개발

41
Emerging
11 OpenMOSS/MOSS-Audio-Tokenizer

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on...

41
Emerging
12 rtzr/Awesome-Korean-Speech-Recognition

한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

41
Emerging
13 amitdev01/awesome-voice-ai

Awesome Voice Ai

41
Emerging
14 forfrt/SteerMoE

SteerMoE: Efficient Audio-Language Models with Preserved Reasoning Capabilities

41
Emerging
15 pymike00/YouTube-Tutorials

:open_file_folder: Source Code for (some of) the Programming Tutorials from...

40
Emerging
16 itspyguru/Tkinter-Applications

A collection of small tkinter apps made by me

40
Emerging
17 jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch

Semi-supervised spoken language understanding (SLU) via self-supervised...

36
Emerging
18 syntithenai/opensnips

Open source projects related to Snips https://snips.ai/.

35
Emerging
19 aditya-joglekar/FS02_Scoring_Toolkit

Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks

34
Emerging
20 Speech-to-text-Kafka-Airflow-Spark/StoTkas

Data engineering pipeline that allows recording millions of Amharic and...

34
Emerging
21 KathyReid/opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in...

33
Emerging
22 seungwonpark/awesome-tts-samples

Awesome list of TTS papers with audio samples

33
Emerging
23 KennethanCeyer/awesome-audio-speech

Awesome list of Audio, Speech, and DSP(Digital signal processing)

32
Emerging
24 nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and...

30
Emerging
25 DevTae/SpeechFeedback

Docker, 음성인식 AI, FastAPI 기반 한국어 발음 교정 시스템

30
Emerging
26 Mierdoso87/Step-Audio-R1.1

🎧 Unlock audio insights with Step-Audio-R1.1, the first model that scales...

27
Experimental
27 tjwodud04/Master-Course-Project

Master course team project code files (석사과정 참여과제 코드 파일)

27
Experimental
28 rafaotetra/awesome-coding-by-voice

A list of videos, papers, tools, APIs and projects about coding by voice

27
Experimental
29 YChenL/UniVR

An official implement of "UniVR: A Unified Framework for Pitch-Shifted Voice...

26
Experimental
30 wildminder/awesome-ai-voice

List of open-source TTS, voice cloning, and music generation models

26
Experimental
31 yaya-sy/speechscorer

unsupervised spoken utterances scoring

26
Experimental
32 34j/awesome-vits

List of repositories relevant to VITS.

26
Experimental
33 Ploscha/Awesome-Audio-Generation

Awesome-Audio-Generation is a collection of resources for Text-to-Audio...

25
Experimental
34 Bangla-Language-Processing/Bangla-Speech-Corpora

Bangla cleaned speech corpus, specially developed for Bangla Text to Speech

25
Experimental
35 danielrosehill/Speech-To-Text-System-Prompt-Library

An updated skeleton library of system prompts for using LLMs to refine STT output

24
Experimental
36 geniusrise/audio

Audio components for geniusrise framework

23
Experimental
37 auralshin/python

python tryout projects

23
Experimental
38 yepicaiaaron/awesome-audio-generation-2026

🎙️ Curated collection of open-source audio generation models released in...

23
Experimental
39 Ghalwash123/MiMo-Audio-Training

🔊 Train audio models efficiently with MiMo-Audio-Training, a toolkit...

22
Experimental
40 TreyDettmer/SitUpTracker

Python application that tracks how many sit-ups a person has done

21
Experimental
41 lhg96/stt-demo-korean

Korean Speech-to-Text app with Whisper & Vosk | 한국어 음성인식 데모 애플리케이션

20
Experimental
42 NatGr/annotate_audio

Helper scripts to split a large audio file into smaller chunks and annotate...

20
Experimental
43 ANVEAI/open-source-voice-ai

Open source voice AI tools, models, and libraries for speech recognition and...

19
Experimental
44 ANVEAI/voice-ai-resources

A curated collection of voice AI tools, libraries, datasets, and learning resources

19
Experimental
45 MargotUCD/LinguaTest

This repository contains the code and resources for the linguistically...

19
Experimental
46 patelritiq/CodeClause-Internship-Projects

A comprehensive collection of 4 Python applications developed during a...

19
Experimental
47 yash-srivastava19/TRINIT_EzDub_ML01

Generative Audio Synthesis Problem Statement(ML01) for TRI-NIT Hackathon 2023.

18
Experimental
48 bhigy/textual-supervision

Code for the paper "Textual supervision for visually grounded spoken...

18
Experimental
49 J0y-B0y/ProductiPy

A unified terminal integrates diverse APIs, streamlining information...

17
Experimental
50 etornam45/mmt-jepa

Using the JEPA architecture for multimodal language translation

17
Experimental
51 Emgicraft/DesProy_ExamenFinal

Desarrollo del Examen Final del curso de Desarrollo de Proyectos.

17
Experimental
52 r-shafi/bangla-speech-to-text

Automatic speech recognition for the Bangla language, one of the world's...

11
Experimental
53 Alex-Sintex/python-playground

Collection of Python practice projects

11
Experimental
54 Push4ck/Personalized

A curated collection of Python-based CLI tools to simplify daily tasks —...

11
Experimental
55 Tuhin-SnapD/Python-Projects

This repository contains a collection of Python projects for beginners to...

11
Experimental
56 macairececile/speech-to-pictograms

Code from the paper "Towards Speech-to-Pictograms Translation" (Interspeech 2024)

10
Experimental
57 AghaEssa/Python__Projects

Hands-on Python projects: Tkinter GUIs, games, file handling, web scraping,...

10
Experimental