Ikaros-521/RealtimeSTT_LLM_TTS

实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

44
/ 100
Emerging

This tool helps you create a real-time voice assistant or conversational AI that can listen to spoken input, understand it, generate a response using advanced AI models like OpenAI or Zhipu AI, and then speak the response back to you. It takes your spoken words as input and provides spoken answers, creating a seamless, natural dialogue experience. This is ideal for anyone looking to build interactive voice applications, customer service bots, or personal assistants.

434 stars. No commits in the last 6 months.

Use this if you need to build a web-based application that allows users to have live, natural-sounding conversations with an AI agent using their voice.

Not ideal if you primarily need to process pre-recorded audio files or if your application doesn't require real-time spoken interaction.

voice-assistant conversational-ai customer-service-automation interactive-voice-response language-tutoring
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

434

Forks

53

Language

Python

License

MIT

Last pushed

Dec 31, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Ikaros-521/RealtimeSTT_LLM_TTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.