YYZhang2025/Pali-Gemma

Implement Multi-Modality-LLM and fine tuning the model using LoRA. Only depends on the PyTorch, no other "fancy" library

/ 100

Experimental

This tool helps you build and refine AI models that can understand both images and text. You can input various images and corresponding text descriptions, and it produces a fine-tuned model capable of processing and generating insights from new multi-modal data. It's designed for AI practitioners or researchers looking to customize multi-modal large language models.

Use this if you need to adapt an AI model to interpret and reason about information presented in both visual and textual formats for a specific application.

Not ideal if you are looking for a ready-to-use, pre-trained multi-modal AI model without any customization or fine-tuning requirements.

AI model customization multi-modal AI deep learning fine-tuning computer vision AI natural language processing AI

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 7 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Higher-rated alternatives

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

jax-ml/jax-llm-examples

Minimal yet performant LLM examples in pure JAX

young-geng/scalax

A simple library for scaling up JAX programs

riyanshibohra/TuneKit

Upload your data → Get a fine-tuned SLM. Free.

Explore Transformer Models

All categories Trending Transformer directory Insights