MooreThreads/MooER
MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.
MooER helps you process spoken language by transforming speech into text or translating it directly into another language. This allows you to understand or communicate across language barriers using only spoken input and output. It's designed for developers building speech interaction applications, particularly those focused on multilingual communication and intelligent assistants.
218 stars. No commits in the last 6 months.
Use this if you need to build applications that can accurately convert spoken words into text (speech recognition) or translate spoken language from one language to another (speech-to-speech translation), especially with Mandarin Chinese support.
Not ideal if your primary need is for purely text-based translation or if you require advanced natural language processing beyond speech interaction, such as complex sentiment analysis or summarization from text.
Stars
218
Forks
17
Language
Python
License
—
Category
Last pushed
Jan 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/MooreThreads/MooER"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jingyaogong/minimind-v
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
SkyworkAI/Skywork-R1V
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in...
roboflow/vision-ai-checkup
Take your LLM to the optometrist.
zai-org/GLM-TTS
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
NExT-GPT/NExT-GPT
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model