tincans-ai/gazelle

Joint speech-language model - respond directly to audio!

41
/ 100
Emerging

This project offers a speech-language model that can directly understand and respond to spoken audio. You provide an audio input, and the model processes both the speech and language to generate a text response. This tool is designed for developers or researchers building applications that require a language model to react immediately to voice commands or spoken content, without needing a separate transcription step.

373 stars. No commits in the last 6 months.

Use this if you are a developer experimenting with advanced AI models that can directly process and respond to spoken audio inputs.

Not ideal if you need a robust, production-ready solution for real-world applications, as these initial versions are not optimized or secure against adversarial attacks.

speech-to-text-AI natural-language-processing AI-model-development voice-AI spoken-language-understanding
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

373

Forks

33

Language

Python

License

Apache-2.0

Last pushed

Jul 01, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/tincans-ai/gazelle"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.