declare-lab/tango
A family of diffusion models for text-to-audio generation.
This tool helps content creators, marketers, and sound designers quickly generate various audio assets from simple text descriptions. You input a phrase like "An audience cheering and clapping," and it outputs realistic sound files like human sounds, animal sounds, or sound effects. It's ideal for anyone needing custom audio without the time or resources for recording or extensive sound library searches.
1,233 stars. No commits in the last 6 months.
Use this if you need to create diverse and realistic sound effects, ambient noises, or short audio clips from text prompts for creative projects, presentations, or digital content.
Not ideal if you require highly specific musical compositions, complex spoken dialogue with unique vocal characteristics, or extremely long audio tracks that demand precise structural control.
Stars
1,233
Forks
108
Language
Python
License
—
Category
Last pushed
Jul 29, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/declare-lab/tango"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ljleb/sd-mecha
Executable State Dict Recipes
SJTU-DENG-Lab/Discrete-Diffusion-Forcing
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
Li-Jinsong/DAEDAL
[ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for...
SalesforceAIResearch/CoDA
Salesforce AI Research's open diffusion language model
ZhanqiuHu/flash-dlm-experimental
Implementation of Flash-DLM (paper: FlashDLM: Accelerating Diffusion Language Models via...