ShivamDuggal4/UNITE-tokenization-generation

Single-stage End-to-End Training for Tokenization and Generation

25
/ 100
Experimental

This project helps machine learning researchers efficiently train generative models that create high-fidelity images or molecular structures. It takes raw image datasets (like ImageNet) or molecule data as input and produces a trained model capable of both converting images into a compressed 'latent' representation (tokenization) and generating new, realistic images or molecules from text descriptions or noise. This is ideal for researchers developing new latent diffusion models or exploring efficient training methods for generative AI.

Use this if you are a researcher focused on developing or improving large-scale image and molecule generation models, and you want to streamline the training process by unifying tokenization and generation.

Not ideal if you are a practitioner looking for an out-of-the-box tool to generate images without deep involvement in model architecture and training.

generative-ai image-synthesis computational-chemistry machine-learning-research deep-learning-training
No License No Package No Dependents
Maintenance 13 / 25
Adoption 8 / 25
Maturity 1 / 25
Community 3 / 25

How are scores calculated?

Stars

62

Forks

1

Language

Python

License

Last pushed

Mar 24, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/ShivamDuggal4/UNITE-tokenization-generation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.