visinf/lnfmm

Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)

/ 100

Emerging

This project helps researchers and developers explore and implement advanced AI models that can generate descriptive captions for images or create images from text descriptions. It takes a collection of images and their corresponding text (like the COCO dataset) and outputs models capable of these cross-domain generations. It's intended for AI/ML researchers, data scientists, and engineers working on natural language processing and computer vision tasks.

No commits in the last 6 months.

Use this if you are an AI/ML researcher or practitioner looking to train and experiment with cutting-edge models for generating text from images or images from text.

Not ideal if you are an end-user seeking a ready-to-use application for image captioning or text-to-image generation without needing to engage with model training or code.

image-captioning text-to-image-generation computer-vision natural-language-processing deep-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

milad1378yz/MOTFM

Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality

OpenImagingLab/FlashVSR

[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient...

X-GenGroup/Flow-Factory

A unified framework for easy reinforcement learning in Flow-Matching models

fallenshock/FlowEdit

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using...

haidog-yaqub/MeanFlow

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling"...

Explore Diffusion Models

All categories Trending Diffusion directory Insights