UCSC-VLAA/CLIPA

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

/ 100

Emerging

This project offers a way to train advanced image and text recognition models, known as CLIP, much more efficiently and at a lower cost. It takes large datasets of images and their corresponding text descriptions as input, and outputs highly accurate CLIP models that can understand and connect visual and linguistic information. This is for machine learning researchers and practitioners who build and deploy AI models for tasks like image search or content moderation.

319 stars. No commits in the last 6 months.

Use this if you need to train high-performing image-text understanding models like CLIP, but are constrained by significant computing resources and high training costs.

Not ideal if you are looking for a pre-trained model to use directly without any further training or fine-tuning, or if you don't work with large-scale vision-language datasets.

deep-learning computer-vision natural-language-processing model-training AI-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

319

Forks

Language

Python

License

Apache-2.0

Compare

CLIPA and open_clip

Higher-rated alternatives

mlfoundations/open_clip

An open source implementation of CLIP.

noxdafox/clipspy

Python CFFI bindings for the 'C' Language Integrated Production System CLIPS

openai/CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

moein-shariatnia/OpenAI-CLIP

Simple implementation of OpenAI CLIP model in PyTorch.

BioMedIA-MBZUAI/FetalCLIP

Official repository of FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis

Explore ML Frameworks

All categories Trending ML Framework directory Insights