karim23657/ParsiGoo
ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It includes recordings from different speakers and is designed to be used for training and evaluating text-to-speech models.
This is a collection of recorded Persian speech from various individuals, specifically gathered to help create and test text-to-speech systems. It provides the audio recordings needed to train a computer to speak Persian with different voices. Speech synthesis researchers and developers working on Persian language applications would use this dataset.
No commits in the last 6 months.
Use this if you are developing or evaluating a system that converts Persian text into spoken audio.
Not ideal if you need a dataset for voice recognition, speech translation, or for languages other than Persian.
Stars
10
Forks
—
Language
—
License
—
Category
Last pushed
Jun 12, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/karim23657/ParsiGoo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...