rk-vashista/TTS-Story_Generator
A versatile app that converts images into short stories and lifelike audio locally. It combines Hugging Face's image captioning, Groq's story generation, and Parler TTS for local text-to-speech synthesis. Ideal for AI-driven projects with fast, reliable on-device TTS.
No commits in the last 6 months.
Stars
—
Forks
1
Language
Python
License
MIT
Category
Last pushed
Sep 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rk-vashista/TTS-Story_Generator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's...
lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server...
fishaudio/fish-speech
SOTA Open Source TTS
sidharthrajaram/StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
mlalma/kokoro-ios
Kokoro TTS for iOS and macOSX