FreedomIntelligence/EchoX
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs
This project helps create speech-to-speech AI assistants that can accurately understand spoken questions and respond intelligently in natural-sounding speech. It takes spoken language as input and generates high-quality, relevant spoken answers. Anyone building or deploying AI systems for customer service, virtual assistants, or interactive voice applications would use this.
No commits in the last 6 months.
Use this if you need a robust speech-to-speech AI model that excels at knowledge-based question answering with efficient training.
Not ideal if your primary need is simple speech transcription or text-to-speech without complex spoken reasoning capabilities.
Stars
47
Forks
7
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 19, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/FreedomIntelligence/EchoX"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
aalok-sathe/surprisal
A unified interface for computing surprisal (log probabilities) from language models! Supports...
EvolvingLMMs-Lab/lmms-engine
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
FunnySaltyFish/Better-Ruozhiba
【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集
reasoning-machines/pal
PaL: Program-Aided Language Models (ICML 2023)
microsoft/monitors4codegen
Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static...