JinXins/Awesome-Token-Merge-for-MLLMs

A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.

/ 100

Emerging

This is a curated list of research papers and associated codebases focused on optimizing how large multimodal language models (MLLMs) process visual information. It compiles different techniques like 'token merge' to make MLLMs more efficient, taking research papers as input and providing summaries, links to papers, and code as output. The primary users are AI researchers and practitioners working on MLLM development and efficiency.

Use this if you are a researcher or engineer looking for a comprehensive overview of recent advancements in vision token compression methods for multimodal large language models.

Not ideal if you are a non-technical user seeking a ready-to-use application or a general introduction to large language models.

AI-research multimodal-AI LLM-efficiency computer-vision deep-learning-optimization

No Package No Dependents

Maintenance 6 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

MIT

Higher-rated alternatives

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...

arcee-ai/mergekit

Tools for merging pretrained large language models.

changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

Explore Transformer Models

All categories Trending Transformer directory Insights