keonlee9420/DailyTalk
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
DailyTalk provides a high-quality dataset of spoken dialogues, specifically designed to help create more natural and context-aware text-to-speech (TTS) systems. It takes text-based conversations and outputs synthesized speech that sounds more like real human interaction. This is ideal for researchers and developers working on advanced voice assistants, virtual characters, or automated customer service.
252 stars. No commits in the last 6 months.
Use this if you are developing conversational AI and need a dataset and baseline model to generate speech that understands and reflects the context of a dialogue.
Not ideal if you only need a basic text-to-speech system for individual, isolated sentences rather than full conversations.
Stars
252
Forks
14
Language
Python
License
MIT
Category
Last pushed
Jun 05, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/DailyTalk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
IS2AI/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis...
taresh18/TTSizer
ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨
Hecate2/sukasuka-vocal-dataset-builder
γγγγγ’γγ‘γγ«γγγΌγΏγ»γγγ1st anime vocal dataset. Extract audio (vocal) files from video based on .ass...
youmebangbang/TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...