LLM Fine-Tuning LLM Tools
Tools, frameworks, and techniques for fine-tuning Large Language Models using methods like LoRA, QLoRA, and instruction tuning on custom datasets. Does NOT include base model training, inference serving, or general LLM applications.
There are 100 llm fine-tuning tools tracked. 2 score above 70 (verified tier). The highest-rated is axolotl-ai-cloud/axolotl at 78/100 with 11,429 stars. 2 of the top 10 are actively maintained.
Get all 100 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-fine-tuning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions |
|
Verified |
| 2 |
google/paxml
Pax is a Jax-based machine learning framework for training large scale... |
|
Verified |
| 3 |
JosefAlbers/PVM
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon |
|
Emerging |
| 4 |
iamarunbrahma/finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset |
|
Emerging |
| 5 |
h2oai/h2o-wizardlm
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for... |
|
Emerging |
| 6 |
WangRongsheng/Aurora
The official codes for "Aurora: Activating chinese chat capability for... |
|
Emerging |
| 7 |
unit-mesh/unit-minions
《AI 研发提效:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关... |
|
Emerging |
| 8 |
MoHussein197/dgx-spark-finetune-llm
🔧 Fine-tune large language models efficiently on NVIDIA DGX Spark with LoRA... |
|
Emerging |
| 9 |
CrazyBoyM/phi3-Chinese
Phi3 中文后训练模型仓库 |
|
Emerging |
| 10 |
anakin87/qwen-scheduler-grpo
Train a Language Model with GRPO to create a schedule from a list of events... |
|
Emerging |
| 11 |
ThomasRochefortB/bettercallbloom
Let's finetune BLOOM-3B on Pile of Law - r/legal_advice |
|
Emerging |
| 12 |
WangRongsheng/MedQA-ChatGLM
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答 |
|
Emerging |
| 13 |
Breeze648/MedCoT-7B
本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth... |
|
Emerging |
| 14 |
prakash-aryan/qwen-arabic-project
This project fine-tunes the Qwen2-1.5B model for Arabic language tasks using... |
|
Emerging |
| 15 |
Nano-Collective/nanotune
A simple, interactive CLI for fine-tuning small language models on Apple... |
|
Emerging |
| 16 |
GURPREETKAURJETHRA/Phi-3-LLM-by-Microsoft
Phi-3 LLM by Microsoft |
|
Emerging |
| 17 |
HomoScriptor-Project/HomoScriptor
Fuel innovation and advance language models with HomoScriptor: A vibrant,... |
|
Emerging |
| 18 |
InternLM/Agent-FLAN
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent... |
|
Emerging |
| 19 |
aws-samples/lambda-gen-ai-endpoint-blog
This repository guides you through the process of using transfer learning to... |
|
Emerging |
| 20 |
alaradirik/finetune-phi-2
Fine tune Phi 2 for persona grounded chat |
|
Emerging |
| 21 |
huawei-csl/AC-LoRA
Welcome to the official repository of AC-LORA: (Almost) Training-Free Access... |
|
Emerging |
| 22 |
hyintell/BLOOM-fine-tuning
Finetune BLOOM |
|
Emerging |
| 23 |
carbonz0/alpaca-chinese-dataset
alpaca中文指令微调数据集 |
|
Emerging |
| 24 |
graphcore/flan-t5
Notebook for Flan-T5 – an alternative to large language models like GPT-3 &... |
|
Emerging |
| 25 |
niuwz/Mini-Chinese-Phi3
基于Phi3模型结构,使用常见的中文预料从零训练的小参数量LLM。包括了tokenizer训练、模型预训练、指令微调和直接偏好优化等流程。 |
|
Emerging |
| 26 |
l11x0m7/LMPresent
Including pre-trained language models for fine-tuning on other NLP tasks |
|
Emerging |
| 27 |
bupticybee/FastLoRAChat
Instruct-tune LLaMA on consumer hardware with shareGPT data |
|
Emerging |
| 28 |
kevintsai/Finetuning-Large-Language-Models
Jupyter notebooks for course Finetuning Large Language Models, taught by... |
|
Emerging |
| 29 |
evanatyourservice/llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers. |
|
Experimental |
| 30 |
dvianna/LegalQA-bloomz-560m
Finetuning a small BLOOMZ model (bloomz-560m) on a small dataset and with... |
|
Experimental |
| 31 |
AnnaValentinaHirsch/Web3CodeLLM
Finetuning Starcoder2 to assist the development of decentralised NEAR dApps |
|
Experimental |
| 32 |
Victorletzelter/LoRA-MCL
Multiple Choice Learning of Low Rank Adapters for Language Modeling |
|
Experimental |
| 33 |
gallen881/Physics_Master
Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer... |
|
Experimental |
| 34 |
LittleLittleCloud/Torchsharp-phi
Torchsharp port of phi-series model |
|
Experimental |
| 35 |
sachink1729/Finetuning-Mistral-7B-Chat-Doctor-Huggingface-LoRA-PEFT
Finetuning Mistral-7B into a Medical Chat Doctor using Huggingface 🤗+ QLoRA + PEFT. |
|
Experimental |
| 36 |
uncase-ai/UNCASE
Open-source framework for turning expert knowledge into PII-free synthetic... |
|
Experimental |
| 37 |
Lichang-Chen/AlpaGasus
A better Alpaca Model Trained with Less Data (only 9k instructions of the... |
|
Experimental |
| 38 |
amjadmajid/llm_toaster
LLM Toaster enables you to train and fine-tune mini-GPTs. |
|
Experimental |
| 39 |
daniau23/LoRAfrica_CPU
Deploying LoRAfrica on consumer CPU devices |
|
Experimental |
| 40 |
mags0ft/simple-sft
Build functionally complete, extremely high-quality SFT datasets for... |
|
Experimental |
| 41 |
robuno/Title-Generator-with-LLM-QLoRa
Fine-tuning LLMs with LoRA to generate titles from the given abstract,... |
|
Experimental |
| 42 |
Siesher/Qwen3_LoRA_pet
🐉 Fine-tuning Qwen3 with LoRA for custom tasks |
|
Experimental |
| 43 |
faezeh-gholamrezaie/Fine-Tuning-Large-Language-Models-for-Sleep-Stage-Classification
Fine-tuning Large Language Models (LLMs) using QLoRA on EEG data for... |
|
Experimental |
| 44 |
zamfir70/transxlab
Training architect CLI — validate and design LLM fine-tuning runs before you... |
|
Experimental |
| 45 |
tomoeOOseven/gptoss120b-qlora-mathreasoning
KrackHack 3.0 submission — Domain: Gen AI | PS: Open Innovation — ... |
|
Experimental |
| 46 |
enoreese/mechanic-gpt
A fine-tuned LLM great at answering questions about car repairs and maintenance. |
|
Experimental |
| 47 |
christinajoslin/faq-generation
CLiFF (Clustering & Language model integration for FAQ Formation) |
|
Experimental |
| 48 |
Shreyash-Gaur/Nyaya-LLM
An ablation study adapting 4B-parameter LLMs (Qwen-2.5, Gemma-3, Phi-4) to... |
|
Experimental |
| 49 |
fb3rasp/finetune-ingest
Ability to finetune LLMs and generate training data using provided documents... |
|
Experimental |
| 50 |
bmaxdk/lightweight-fine-tuning-customer-support
PEFT Customer Support Chatbot |
|
Experimental |
| 51 |
tonyreina/trl
Transformer Reinforcement Learning for Health Generative AI |
|
Experimental |
| 52 |
anujsahani01/PyLoomer
Python Code Completion bot |
|
Experimental |
| 53 |
Pavansomisetty21/Vision_Finetuning_Unsloth_Radiography-Image-Captioning
In this we fine tune Llama-3.2-11B-Vision-Instruct model on... |
|
Experimental |
| 54 |
zekaouinoureddine/BioMed-LLaMa-3
BioMed-LLaMa-3: Instruction-Efficient Fine-Tuning of Large Language Models... |
|
Experimental |
| 55 |
KayvanShah1/UniFAQ
Fine-Tuned LLM-Based FAQ Generation for University Admissions: A project... |
|
Experimental |
| 56 |
shadynasrat/RDMM
RDMM:Fine-Tuned LLM Models for On-Device Robotic Decision Making with... |
|
Experimental |
| 57 |
royxlead/autollmforge-python
Fine-tune any large language model with intelligent QLoRA optimization |
|
Experimental |
| 58 |
xingmingxu/LiteSight
Efficient Chart Summarization with LoRA |
|
Experimental |
| 59 |
apudasm10/region-aware-vlm-finetune
Pipeline for finetuning VLMs with region-aware inputs. Trains on custom... |
|
Experimental |
| 60 |
AkhileshMalthi/selftune
A self-service platform that enables users to fine-tune Large Language... |
|
Experimental |
| 61 |
bshtmichielsen/expert_chat
Using a LoRA to make a LLM talk about a subject I like. |
|
Experimental |
| 62 |
SaniyaBekova/kazakh-llm-finetuning
LLM fine-tuning for Kazakh fairy tale generation using QLoRA, SFT, DPO |
|
Experimental |
| 63 |
cre8vdj/cre8v-ai-finetune
Fine-tune Llama 2 / Mistral with LoRA & QLoRA using PEFT. Runs on free Colab... |
|
Experimental |
| 64 |
Nihal108-bi/Emotion-Aware-Conversational-AI-QLoRA-Fine-Tuned-7B-LLM-
Fine-tuned 7B LLM for empathetic emotional-support dialogue using QLoRA.... |
|
Experimental |
| 65 |
flaviengeoffray/loRa-reimplem
A practical reimplementation of the Low-Rank Adaptation (LoRA) paper for... |
|
Experimental |
| 66 |
jasonjiang8866/peft-fine-tuning-recipes-classification
A working recipes for sequential classification finetuning using peft |
|
Experimental |
| 67 |
fabiantoh98/finetune-llm
Fine-tuning LLMs with QLoRA on consumer GPUs — includes training,... |
|
Experimental |
| 68 |
Dhwani-Chande/Natural-Language-to-Bash-Translation-using-LLMs
Fine-tuned Llama-3.2-1B & Qwen2.5-Coder on 40K NL→Bash pairs. Includes... |
|
Experimental |
| 69 |
btboilerplate/Llama-2
Fine-tunes LLaMA-2 using QLoRA for instruction-style text generation,... |
|
Experimental |
| 70 |
YanSte/NLP-LLM-Fine-tuning-QA-LoRA-T5
Natural Language Processing (NLP) and Large Language Models (LLM) with... |
|
Experimental |
| 71 |
myatthukyaw/ft-llm
Finetuning LLMs using Hugging Face |
|
Experimental |
| 72 |
Anonymous-user-00/FLoRIST
Official implementation of FLoRIST: efficient and accurate federated... |
|
Experimental |
| 73 |
123RohitVarshit/FINETUNED_DEEPSEEK-R1
Fine-tuning the DeepSeek-LLM to create a medical expert for advanced... |
|
Experimental |
| 74 |
gamithasam/notion-qwen2.5-1.5B
Fine-tuning notebook for creating a Notion template generator using... |
|
Experimental |
| 75 |
ShubhammS18/finetune-json-extractor
Fine-tuned Qwen2.5-7B on Fireworks AI for structured JSON extraction from... |
|
Experimental |
| 76 |
1nilx2/Deep-Learning
LLM, VLLM Models |
|
Experimental |
| 77 |
BetikuOluwatobi/clinical-instruct-api
Fine-tuned GPT-2 (355M) language model for clinical reasoning tasks. |
|
Experimental |
| 78 |
avishek04/MedLam
A Medical Assistant based on Llama 3.1 |
|
Experimental |
| 79 |
0x7o/ae
Scalable code for training and fine-tuning language models on TPUs |
|
Experimental |
| 80 |
haturusinghe/subasa-llm
A task-specific fine-tuning framework for large language models (Llama,... |
|
Experimental |
| 81 |
quamernasim/Fine-Tuning-Mistral-7B-Using-Llama-Factory
Fine-tuning of Mistral-7b using Llama-Factory |
|
Experimental |
| 82 |
soheil-mp/Llama2
Fine-tuning the Llama2 model |
|
Experimental |
| 83 |
vimarsh6739/DejaVu-llama
Exploring contextual sparsity in Llama2 |
|
Experimental |
| 84 |
SinnieOnFire/jsonl-finetune
Python script to transform a set of localization .json files into a .jsonl... |
|
Experimental |
| 85 |
NamrataThakur/Fine-tuning-LLMs-Strategies
Different Strategies to Fine-Tune a Large Language Model. We cover 4... |
|
Experimental |
| 86 |
nsrinidhibhat/fine-tune-llama-2
This project streamlines the fine-tuning process, enabling you to leverage... |
|
Experimental |
| 87 |
vritansh/talk-to-you-now-llm
LLM Finetuning : falcon 7 Billion Model trained on Mental Health conversations |
|
Experimental |
| 88 |
nayeem01/fine-tuning-llama
Fine tuning llama3.1 8b with unsloth |
|
Experimental |
| 89 |
ajf1016/Fine-Tuning-Qwen1.5-0.5B
Fine Tuning Qwen1.5-0.5B LLM with India Law | Indian Legal Acts | Penal Code... |
|
Experimental |
| 90 |
0x11c11e/the-art-of-fine-tuning
This repository houses a wealth of resources on the fine-tuning of large... |
|
Experimental |
| 91 |
thillai-c/MediQuill-llama2
A model fine tuned on llama-2 to solve medical queries |
|
Experimental |
| 92 |
trjo1/genaiwithllms
Fine-tuned FLAN T-5 using Instruction Fine-Tuning (Full), LoRA-based PEFT,... |
|
Experimental |
| 93 |
Maximo-Rulli/PoLLiBLOOM
Fine-tuning BLOOM to generate Polimi style physics excercises |
|
Experimental |
| 94 |
Oyebamiji-Micheal/Llama-for-UTME-preparation
Fine-tuning Llama on past UTME questions using unsloth |
|
Experimental |
| 95 |
Jaskirat-singh04/Tunewizard
This is the official Github Repo for Tunewizard-GUI Based Fine-Tuning of... |
|
Experimental |
| 96 |
maidacundo/falcon-7b-sql
Implementation for fine-tuning a Falcon-7b model using QLoRA on the Spider... |
|
Experimental |
| 97 |
naveen-v-v/LLM_fine_tune_lora
Fine tune a Large Language Model using LORA to perform Sentiment Analysis |
|
Experimental |
| 98 |
Seanaaa0/GPT-CoT
Fine-tuning Phi-2 with LoRA for grid-based spatial reasoning and... |
|
Experimental |
| 99 |
sdpetrides/t5x-train-and-test
Pre-training and fine-tuning experiments with T5 |
|
Experimental |
| 100 |
Ravi-Teja-konda/TunedLlavaDelights
Explore the rich flavors of Indian desserts with TunedLlavaDelights.... |
|
Experimental |