Henry-23/VideoChat
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
This tool helps create real-time interactive digital humans that can converse naturally. You provide the avatar's visual appearance and either pre-set voices or clone a new voice from a short audio sample. The output is a digital human that can engage in low-latency voice conversations, suitable for customer service, virtual assistants, or interactive media experiences.
1,223 stars.
Use this if you need to deploy a digital human for real-time voice interaction, with customizable appearance and voice, and quick response times.
Not ideal if your primary need is static video generation or highly complex, multi-modal human-like behaviors beyond real-time voice conversation.
Stars
1,223
Forks
158
Language
Python
License
MIT
Category
Last pushed
Dec 18, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Henry-23/VideoChat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
met4citizen/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using full-body 3D avatars.
livekit/livekit
End-to-end realtime stack for connecting humans and AI
dmisol/flexatar-virtual-webcam
Personalized Virtual Webcam for WebRTC
zslrmhb/Omniverse-Virtual-Assisstant
Audio2Face Avatar with Riva SDK functionality
Sgvkamalakar/Azure-Talking-Avatar
Explore the power of Azure Text-to-Speech with interactive talking avatar, Lisa 👩🏻🦱. Choose...