deep-div/Fine-Tuning-LLMs-and-VisionModels
Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on custom datasets.
This helps machine learning engineers and researchers adapt powerful AI language models like Gemma or LLaMA to specific tasks or datasets. You provide your custom text data and an existing open-source language model, and it produces a specialized version of that model capable of generating responses tailored to your unique requirements. This is ideal for those building AI applications that need to understand or generate highly domain-specific text.
No commits in the last 6 months.
Use this if you need to customize an open-source large language model to perform better on your specific text data or for a niche application, rather than using a general-purpose model out-of-the-box.
Not ideal if you are looking for a pre-trained model to use directly without any customization or if you are not comfortable working with machine learning frameworks and code.
Stars
17
Forks
4
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Sep 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/deep-div/Fine-Tuning-LLMs-and-VisionModels"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...
oumi-ai/oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training