Paranioar/Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

46
/ 100
Emerging

This is a curated collection of research papers and tutorials focused on large multi-modality models, covering their perception, generation, and unification capabilities. It also includes resources on efficient finetuning techniques and methods for vision-language pretraining. Researchers and AI practitioners working on advanced AI models would use this to understand the current landscape and latest advancements in combining different data types like images and text.

445 stars. No commits in the last 6 months.

Use this if you are a researcher or AI practitioner looking for a comprehensive overview and resources on multi-modal AI models, including efficient training and pretraining techniques.

Not ideal if you are an end-user seeking a ready-to-use AI tool or application for a specific task rather than academic research and model development.

AI research multi-modal learning deep learning architectures model training computer vision
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

445

Forks

49

Language

License

MIT

Last pushed

Sep 25, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Paranioar/Awesome_Matching_Pretraining_Transfering"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.