khalooei/Voxtral-AI-Demo-Local-Interface

Voxtral is a state-of-the-art model developed to handle both speech transcription and audio understanding with remarkable accuracy and efficiency. This demo interface lets you run the Voxtral model on powerful GPUs to evaluate its performance and see how it can be used for transcription and deeper analysis.

30
/ 100
Emerging

This interface helps you analyze spoken content from audio files. You provide audio recordings, and it delivers accurate text transcriptions, summaries, or answers to questions about what was said. Anyone who needs to extract insights, meaning, or written records from spoken information would find this valuable.

No commits in the last 6 months.

Use this if you need to quickly and accurately transcribe, summarize, or extract information from audio recordings, especially long-form and multilingual speech.

Not ideal if you don't have access to powerful GPUs (like NVIDIA H100/A100 or RTX 3090/4090) to run the models efficiently.

audio-transcription speech-analysis meeting-minutes content-summarization multilingual-communication
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 7 / 25
Maturity 15 / 25
Community 6 / 25

How are scores calculated?

Stars

29

Forks

2

Language

Python

License

MIT

Last pushed

Jul 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/khalooei/Voxtral-AI-Demo-Local-Interface"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.