Ikaros-521/RealtimeSTT_LLM_TTS
实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果
This tool helps you create a real-time voice assistant or conversational AI that can listen to spoken input, understand it, generate a response using advanced AI models like OpenAI or Zhipu AI, and then speak the response back to you. It takes your spoken words as input and provides spoken answers, creating a seamless, natural dialogue experience. This is ideal for anyone looking to build interactive voice applications, customer service bots, or personal assistants.
434 stars. No commits in the last 6 months.
Use this if you need to build a web-based application that allows users to have live, natural-sounding conversations with an AI agent using their voice.
Not ideal if you primarily need to process pre-recorded audio files or if your application doesn't require real-time spoken interaction.
Stars
434
Forks
53
Language
Python
License
MIT
Category
Last pushed
Dec 31, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Ikaros-521/RealtimeSTT_LLM_TTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bigsk1/voice-chat-ai
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS,...
digiteinfotech/kairon
Agentic AI platform that harnesses Visual LLM Chaining to build proactive digital assistants
withcatai/catai
Run AI ✨ assistant locally! with simple API for Node.js 🚀
AmberSahdev/Open-Interface
Control Any Computer Using LLMs.
second-state/echokit_server
Open Source Voice Agent Platform