PunithVT/ai-avatar-system
🎭 Open-source AI avatar platform — upload a photo, clone a voice, talk to any face in real time. Lip-sync video, voice cloning, WebSocket streaming. Powered by Claude, Whisper & SadTalker.
This platform helps you create lifelike AI-driven digital spokespeople. You provide a photo of a person and a short audio clip of their voice, and the system generates real-time video of that person speaking any text you input, with perfectly synchronized lip movements. It's designed for content creators, marketers, educators, and anyone needing a custom, dynamic virtual presenter.
Use this if you need to quickly generate video content with a custom, talking avatar that can speak in various languages and respond dynamically in real time.
Not ideal if you're looking for pre-rendered, high-fidelity animated characters with complex body language, as this focuses on realistic lip-sync and voice cloning from a static image.
Stars
7
Forks
1
Language
Python
License
—
Category
Last pushed
Mar 26, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/PunithVT/ai-avatar-system"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
met4citizen/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using full-body 3D avatars.
livekit/livekit
End-to-end realtime stack for connecting humans and AI
Henry-23/VideoChat
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable...
dmisol/flexatar-virtual-webcam
Personalized Virtual Webcam for WebRTC
zslrmhb/Omniverse-Virtual-Assisstant
Audio2Face Avatar with Riva SDK functionality