khalooei/Voxtral-AI-Demo-Local-Interface
Voxtral is a state-of-the-art model developed to handle both speech transcription and audio understanding with remarkable accuracy and efficiency. This demo interface lets you run the Voxtral model on powerful GPUs to evaluate its performance and see how it can be used for transcription and deeper analysis.
This interface helps you analyze spoken content from audio files. You provide audio recordings, and it delivers accurate text transcriptions, summaries, or answers to questions about what was said. Anyone who needs to extract insights, meaning, or written records from spoken information would find this valuable.
No commits in the last 6 months.
Use this if you need to quickly and accurately transcribe, summarize, or extract information from audio recordings, especially long-form and multilingual speech.
Not ideal if you don't have access to powerful GPUs (like NVIDIA H100/A100 or RTX 3090/4090) to run the models efficiently.
Stars
29
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jul 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/khalooei/Voxtral-AI-Demo-Local-Interface"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
supertone-inc/supertonic
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
roryeckel/wyoming_openai
OpenAI-Compatible Proxy Middleware for the Wyoming Protocol
PyThaiNLP/PyThaiTTS
Open Source Thai Text-to-speech library in Python
Ailln/cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7
臺灣言語工具