visinf/lnfmm
Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)
This project helps researchers and developers explore and implement advanced AI models that can generate descriptive captions for images or create images from text descriptions. It takes a collection of images and their corresponding text (like the COCO dataset) and outputs models capable of these cross-domain generations. It's intended for AI/ML researchers, data scientists, and engineers working on natural language processing and computer vision tasks.
No commits in the last 6 months.
Use this if you are an AI/ML researcher or practitioner looking to train and experiment with cutting-edge models for generating text from images or images from text.
Not ideal if you are an end-user seeking a ready-to-use application for image captioning or text-to-image generation without needing to engage with model training or code.
Stars
33
Forks
12
Language
Python
License
Apache-2.0
Category
Last pushed
May 12, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/visinf/lnfmm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
milad1378yz/MOTFM
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality
OpenImagingLab/FlashVSR
[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient...
X-GenGroup/Flow-Factory
A unified framework for easy reinforcement learning in Flow-Matching models
fallenshock/FlowEdit
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using...
haidog-yaqub/MeanFlow
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling"...