espnet/interspeech2019-tutorial

INTERSPEECH 2019 Tutorial Materials

/ 100

Emerging

This provides hands-on educational materials for advanced methods in neural end-to-end speech processing. It takes in raw speech audio and generates either synthesized speech (Text-to-Speech) or transcribed text (Automatic Speech Recognition). This is for researchers, engineers, and students who are learning about or implementing speech AI systems.

194 stars. No commits in the last 6 months.

Use this if you are an AI researcher or student wanting to learn practical implementations of state-of-the-art speech processing models like Text-to-Speech or Automatic Speech Recognition.

Not ideal if you are looking for a pre-built, production-ready speech AI service or a general introduction to machine learning.

Speech AI research Text-to-Speech Automatic Speech Recognition Neural network speech processing Speech technology education

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 21 / 25

How are scores calculated?

Stars

194

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

Picovoice/rhino

On-device Speech-to-Intent engine powered by deep learning

yandexdataschool/speech_course

YSDA course in Speech Processing.

MycroftAI/adapt

Adapt Intent Parser

Picovoice/speech-to-intent-benchmark

benchmark for Speech-to-Intent engines

IBM/BigLittleNet

Official repository for Big-Little Net

Explore Voice AI Tools

All categories Trending Voice AI directory Insights