espnet/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
This provides hands-on educational materials for advanced methods in neural end-to-end speech processing. It takes in raw speech audio and generates either synthesized speech (Text-to-Speech) or transcribed text (Automatic Speech Recognition). This is for researchers, engineers, and students who are learning about or implementing speech AI systems.
194 stars. No commits in the last 6 months.
Use this if you are an AI researcher or student wanting to learn practical implementations of state-of-the-art speech processing models like Text-to-Speech or Automatic Speech Recognition.
Not ideal if you are looking for a pre-built, production-ready speech AI service or a general introduction to machine learning.
Stars
194
Forks
39
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 30, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/espnet/interspeech2019-tutorial"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Picovoice/rhino
On-device Speech-to-Intent engine powered by deep learning
yandexdataschool/speech_course
YSDA course in Speech Processing.
MycroftAI/adapt
Adapt Intent Parser
Picovoice/speech-to-intent-benchmark
benchmark for Speech-to-Intent engines
IBM/BigLittleNet
Official repository for Big-Little Net