georgygospodinov/speech_course
Deep Learning for Speech
This course provides comprehensive educational materials for understanding and implementing deep learning techniques in speech processing. It guides users through various tasks, taking raw audio or text input to produce processed speech, transcriptions, or synthesized voices. It's designed for students or researchers who want to gain expertise in building advanced speech technology.
109 stars.
Use this if you want to learn the core concepts and practical applications of deep learning in speech processing, from foundational signal processing to advanced generative models.
Not ideal if you're looking for a plug-and-play tool to solve a specific speech problem without diving into the underlying theory and implementation.
Stars
109
Forks
15
Language
Jupyter Notebook
License
—
Category
Last pushed
Dec 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/georgygospodinov/speech_course"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
felixbur/nkululeko
Machine learning speaker characteristics
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.