IS2AI/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
This project helps create natural-sounding spoken audio from written Kazakh text. You provide Kazakh text, and it produces an audio file of that text being spoken aloud. This is useful for content creators, educators, or businesses looking to generate Kazakh speech for various applications.
147 stars. No commits in the last 6 months.
Use this if you need to convert written Kazakh text into spoken audio, especially for applications like audiobooks, voice assistants, or e-learning materials.
Not ideal if you need to perform speech recognition (converting audio to text) or synthesize speech in languages other than Kazakh.
Stars
147
Forks
26
Language
Shell
License
CC-BY-4.0
Category
Last pushed
Aug 01, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/IS2AI/Kazakh_TTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
taresh18/TTSizer
ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨
Hecate2/sukasuka-vocal-dataset-builder
γγγγγ’γγ‘γγ«γγγΌγΏγ»γγγ1st anime vocal dataset. Extract audio (vocal) files from video based on .ass...
youmebangbang/TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...
souvikg544/TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio...