Zheng-Chong/CatVTON
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
This project helps online retailers and fashion brands create photorealistic images of clothing on models without physical photoshoots. You provide an image of a person and an image of a garment, and it generates a new image showing the person "wearing" that garment. This is ideal for e-commerce, virtual try-on experiences, or digital fashion design, allowing marketers and product managers to quickly visualize new collections.
1,615 stars.
Use this if you need to generate high-quality images of clothing on diverse models quickly and cost-effectively, reducing the need for traditional photography.
Not ideal if you require real-time, interactive video try-on experiences, though a future version may offer this capability.
Stars
1,615
Forks
207
Language
Python
License
—
Category
Last pushed
Dec 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Zheng-Chong/CatVTON"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
rizavelioglu/tryoffdiff
[CVPR'25-Demo] Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment...
muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It...
muzishen/IMAGGarment
[TVCG 2026] 🎨 IMAGGarment🎨 : Fine-Grained Garment Generation with Controllable Structure,...
miccunifi/ladi-vton
[ACM MM 2023] - LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
aimagelab/multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent...