koudounasalkis/Audio-Speech-Tutorial
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
This tutorial helps you understand how sound works digitally and how to prepare audio for advanced applications. You'll learn what goes into building systems that can identify speakers, recognize emotions from speech, or convert speech to text. This is for anyone interested in applying artificial intelligence to audio data, like a data scientist, researcher, or sound engineer.
No commits in the last 6 months.
Use this if you are a data scientist or researcher new to audio processing and want a foundational understanding to build speech-related AI applications.
Not ideal if you are looking for a pre-built, production-ready tool or an in-depth academic textbook on advanced signal processing.
Stars
21
Forks
3
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 20, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/koudounasalkis/Audio-Speech-Tutorial"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Picovoice/rhino
On-device Speech-to-Intent engine powered by deep learning
yandexdataschool/speech_course
YSDA course in Speech Processing.
MycroftAI/adapt
Adapt Intent Parser
Picovoice/speech-to-intent-benchmark
benchmark for Speech-to-Intent engines
IBM/BigLittleNet
Official repository for Big-Little Net