khalooei/Voxtral-AI-Demo-Local-Interface

Voxtral is a state-of-the-art model developed to handle both speech transcription and audio understanding with remarkable accuracy and efficiency. This demo interface lets you run the Voxtral model on powerful GPUs to evaluate its performance and see how it can be used for transcription and deeper analysis.

/ 100

Emerging

This interface helps you analyze spoken content from audio files. You provide audio recordings, and it delivers accurate text transcriptions, summaries, or answers to questions about what was said. Anyone who needs to extract insights, meaning, or written records from spoken information would find this valuable.

No commits in the last 6 months.

Use this if you need to quickly and accurately transcribe, summarize, or extract information from audio recordings, especially long-form and multilingual speech.

Not ideal if you don't have access to powerful GPUs (like NVIDIA H100/A100 or RTX 3090/4090) to run the models efficiently.

audio-transcription speech-analysis meeting-minutes content-summarization multilingual-communication

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 15 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

supertone-inc/supertonic

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

roryeckel/wyoming_openai

OpenAI-Compatible Proxy Middleware for the Wyoming Protocol

PyThaiNLP/PyThaiTTS

Open Source Thai Text-to-speech library in Python

Ailln/cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7

臺灣言語工具

Explore Voice AI Tools

All categories Trending Voice AI directory Insights