LLM Fine-Tuning LLM Tools

Tools, frameworks, and techniques for fine-tuning Large Language Models using methods like LoRA, QLoRA, and instruction tuning on custom datasets. Does NOT include base model training, inference serving, or general LLM applications.

There are 100 llm fine-tuning tools tracked. 2 score above 70 (verified tier). The highest-rated is axolotl-ai-cloud/axolotl at 78/100 with 11,429 stars. 2 of the top 10 are actively maintained.

Get all 100 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-fine-tuning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 axolotl-ai-cloud/axolotl

Go ahead and axolotl questions

78
Verified
2 google/paxml

Pax is a Jax-based machine learning framework for training large scale...

71
Verified
3 JosefAlbers/PVM

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

45
Emerging
4 iamarunbrahma/finetuned-qlora-falcon7b-medical

Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset

41
Emerging
5 h2oai/h2o-wizardlm

Open-Source Implementation of WizardLM to turn documents into Q:A pairs for...

40
Emerging
6 WangRongsheng/Aurora

The official codes for "Aurora: Activating chinese chat capability for...

39
Emerging
7 unit-mesh/unit-minions

《AI 研发提效:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关...

39
Emerging
8 MoHussein197/dgx-spark-finetune-llm

🔧 Fine-tune large language models efficiently on NVIDIA DGX Spark with LoRA...

39
Emerging
9 CrazyBoyM/phi3-Chinese

Phi3 中文后训练模型仓库

38
Emerging
10 anakin87/qwen-scheduler-grpo

Train a Language Model with GRPO to create a schedule from a list of events...

38
Emerging
11 ThomasRochefortB/bettercallbloom

Let's finetune BLOOM-3B on Pile of Law - r/legal_advice

38
Emerging
12 WangRongsheng/MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答

37
Emerging
13 Breeze648/MedCoT-7B

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth...

36
Emerging
14 prakash-aryan/qwen-arabic-project

This project fine-tunes the Qwen2-1.5B model for Arabic language tasks using...

35
Emerging
15 Nano-Collective/nanotune

A simple, interactive CLI for fine-tuning small language models on Apple...

35
Emerging
16 GURPREETKAURJETHRA/Phi-3-LLM-by-Microsoft

Phi-3 LLM by Microsoft

34
Emerging
17 HomoScriptor-Project/HomoScriptor

Fuel innovation and advance language models with HomoScriptor: A vibrant,...

34
Emerging
18 InternLM/Agent-FLAN

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent...

34
Emerging
19 aws-samples/lambda-gen-ai-endpoint-blog

This repository guides you through the process of using transfer learning to...

34
Emerging
20 alaradirik/finetune-phi-2

Fine tune Phi 2 for persona grounded chat

33
Emerging
21 huawei-csl/AC-LoRA

Welcome to the official repository of AC-LORA: (Almost) Training-Free Access...

32
Emerging
22 hyintell/BLOOM-fine-tuning

Finetune BLOOM

32
Emerging
23 carbonz0/alpaca-chinese-dataset

alpaca中文指令微调数据集

31
Emerging
24 graphcore/flan-t5

Notebook for Flan-T5 – an alternative to large language models like GPT-3 &...

31
Emerging
25 niuwz/Mini-Chinese-Phi3

基于Phi3模型结构,使用常见的中文预料从零训练的小参数量LLM。包括了tokenizer训练、模型预训练、指令微调和直接偏好优化等流程。

30
Emerging
26 l11x0m7/LMPresent

Including pre-trained language models for fine-tuning on other NLP tasks

30
Emerging
27 bupticybee/FastLoRAChat

Instruct-tune LLaMA on consumer hardware with shareGPT data

30
Emerging
28 kevintsai/Finetuning-Large-Language-Models

Jupyter notebooks for course Finetuning Large Language Models, taught by...

30
Emerging
29 evanatyourservice/llm-jax

Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.

29
Experimental
30 dvianna/LegalQA-bloomz-560m

Finetuning a small BLOOMZ model (bloomz-560m) on a small dataset and with...

29
Experimental
31 AnnaValentinaHirsch/Web3CodeLLM

Finetuning Starcoder2 to assist the development of decentralised NEAR dApps

29
Experimental
32 Victorletzelter/LoRA-MCL

Multiple Choice Learning of Low Rank Adapters for Language Modeling

28
Experimental
33 gallen881/Physics_Master

Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer...

27
Experimental
34 LittleLittleCloud/Torchsharp-phi

Torchsharp port of phi-series model

26
Experimental
35 sachink1729/Finetuning-Mistral-7B-Chat-Doctor-Huggingface-LoRA-PEFT

Finetuning Mistral-7B into a Medical Chat Doctor using Huggingface 🤗+ QLoRA + PEFT.

25
Experimental
36 uncase-ai/UNCASE

Open-source framework for turning expert knowledge into PII-free synthetic...

24
Experimental
37 Lichang-Chen/AlpaGasus

A better Alpaca Model Trained with Less Data (only 9k instructions of the...

24
Experimental
38 amjadmajid/llm_toaster

LLM Toaster enables you to train and fine-tune mini-GPTs.

24
Experimental
39 daniau23/LoRAfrica_CPU

Deploying LoRAfrica on consumer CPU devices

22
Experimental
40 mags0ft/simple-sft

Build functionally complete, extremely high-quality SFT datasets for...

22
Experimental
41 robuno/Title-Generator-with-LLM-QLoRa

Fine-tuning LLMs with LoRA to generate titles from the given abstract,...

22
Experimental
42 Siesher/Qwen3_LoRA_pet

🐉 Fine-tuning Qwen3 with LoRA for custom tasks

22
Experimental
43 faezeh-gholamrezaie/Fine-Tuning-Large-Language-Models-for-Sleep-Stage-Classification

Fine-tuning Large Language Models (LLMs) using QLoRA on EEG data for...

21
Experimental
44 zamfir70/transxlab

Training architect CLI — validate and design LLM fine-tuning runs before you...

21
Experimental
45 tomoeOOseven/gptoss120b-qlora-mathreasoning

KrackHack 3.0 submission — Domain: Gen AI | PS: Open Innovation — ...

21
Experimental
46 enoreese/mechanic-gpt

A fine-tuned LLM great at answering questions about car repairs and maintenance.

21
Experimental
47 christinajoslin/faq-generation

CLiFF (Clustering & Language model integration for FAQ Formation)

20
Experimental
48 Shreyash-Gaur/Nyaya-LLM

An ablation study adapting 4B-parameter LLMs (Qwen-2.5, Gemma-3, Phi-4) to...

19
Experimental
49 fb3rasp/finetune-ingest

Ability to finetune LLMs and generate training data using provided documents...

19
Experimental
50 bmaxdk/lightweight-fine-tuning-customer-support

PEFT Customer Support Chatbot

19
Experimental
51 tonyreina/trl

Transformer Reinforcement Learning for Health Generative AI

19
Experimental
52 anujsahani01/PyLoomer

Python Code Completion bot

18
Experimental
53 Pavansomisetty21/Vision_Finetuning_Unsloth_Radiography-Image-Captioning

In this we fine tune Llama-3.2-11B-Vision-Instruct model on...

18
Experimental
54 zekaouinoureddine/BioMed-LLaMa-3

BioMed-LLaMa-3: Instruction-Efficient Fine-Tuning of Large Language Models...

18
Experimental
55 KayvanShah1/UniFAQ

Fine-Tuned LLM-Based FAQ Generation for University Admissions: A project...

17
Experimental
56 shadynasrat/RDMM

RDMM:Fine-Tuned LLM Models for On-Device Robotic Decision Making with...

17
Experimental
57 royxlead/autollmforge-python

Fine-tune any large language model with intelligent QLoRA optimization

17
Experimental
58 xingmingxu/LiteSight

Efficient Chart Summarization with LoRA

17
Experimental
59 apudasm10/region-aware-vlm-finetune

Pipeline for finetuning VLMs with region-aware inputs. Trains on custom...

17
Experimental
60 AkhileshMalthi/selftune

A self-service platform that enables users to fine-tune Large Language...

16
Experimental
61 bshtmichielsen/expert_chat

Using a LoRA to make a LLM talk about a subject I like.

14
Experimental
62 SaniyaBekova/kazakh-llm-finetuning

LLM fine-tuning for Kazakh fairy tale generation using QLoRA, SFT, DPO

14
Experimental
63 cre8vdj/cre8v-ai-finetune

Fine-tune Llama 2 / Mistral with LoRA & QLoRA using PEFT. Runs on free Colab...

14
Experimental
64 Nihal108-bi/Emotion-Aware-Conversational-AI-QLoRA-Fine-Tuned-7B-LLM-

Fine-tuned 7B LLM for empathetic emotional-support dialogue using QLoRA....

14
Experimental
65 flaviengeoffray/loRa-reimplem

A practical reimplementation of the Low-Rank Adaptation (LoRA) paper for...

14
Experimental
66 jasonjiang8866/peft-fine-tuning-recipes-classification

A working recipes for sequential classification finetuning using peft

13
Experimental
67 fabiantoh98/finetune-llm

Fine-tuning LLMs with QLoRA on consumer GPUs — includes training,...

13
Experimental
68 Dhwani-Chande/Natural-Language-to-Bash-Translation-using-LLMs

Fine-tuned Llama-3.2-1B & Qwen2.5-Coder on 40K NL→Bash pairs. Includes...

13
Experimental
69 btboilerplate/Llama-2

Fine-tunes LLaMA-2 using QLoRA for instruction-style text generation,...

13
Experimental
70 YanSte/NLP-LLM-Fine-tuning-QA-LoRA-T5

Natural Language Processing (NLP) and Large Language Models (LLM) with...

13
Experimental
71 myatthukyaw/ft-llm

Finetuning LLMs using Hugging Face

13
Experimental
72 Anonymous-user-00/FLoRIST

Official implementation of FLoRIST: efficient and accurate federated...

13
Experimental
73 123RohitVarshit/FINETUNED_DEEPSEEK-R1

Fine-tuning the DeepSeek-LLM to create a medical expert for advanced...

13
Experimental
74 gamithasam/notion-qwen2.5-1.5B

Fine-tuning notebook for creating a Notion template generator using...

13
Experimental
75 ShubhammS18/finetune-json-extractor

Fine-tuned Qwen2.5-7B on Fireworks AI for structured JSON extraction from...

13
Experimental
76 1nilx2/Deep-Learning

LLM, VLLM Models

13
Experimental
77 BetikuOluwatobi/clinical-instruct-api

Fine-tuned GPT-2 (355M) language model for clinical reasoning tasks.

13
Experimental
78 avishek04/MedLam

A Medical Assistant based on Llama 3.1

11
Experimental
79 0x7o/ae

Scalable code for training and fine-tuning language models on TPUs

11
Experimental
80 haturusinghe/subasa-llm

A task-specific fine-tuning framework for large language models (Llama,...

11
Experimental
81 quamernasim/Fine-Tuning-Mistral-7B-Using-Llama-Factory

Fine-tuning of Mistral-7b using Llama-Factory

11
Experimental
82 soheil-mp/Llama2

Fine-tuning the Llama2 model

11
Experimental
83 vimarsh6739/DejaVu-llama

Exploring contextual sparsity in Llama2

11
Experimental
84 SinnieOnFire/jsonl-finetune

Python script to transform a set of localization .json files into a .jsonl...

11
Experimental
85 NamrataThakur/Fine-tuning-LLMs-Strategies

Different Strategies to Fine-Tune a Large Language Model. We cover 4...

11
Experimental
86 nsrinidhibhat/fine-tune-llama-2

This project streamlines the fine-tuning process, enabling you to leverage...

11
Experimental
87 vritansh/talk-to-you-now-llm

LLM Finetuning : falcon 7 Billion Model trained on Mental Health conversations

11
Experimental
88 nayeem01/fine-tuning-llama

Fine tuning llama3.1 8b with unsloth

11
Experimental
89 ajf1016/Fine-Tuning-Qwen1.5-0.5B

Fine Tuning Qwen1.5-0.5B LLM with India Law | Indian Legal Acts | Penal Code...

11
Experimental
90 0x11c11e/the-art-of-fine-tuning

This repository houses a wealth of resources on the fine-tuning of large...

11
Experimental
91 thillai-c/MediQuill-llama2

A model fine tuned on llama-2 to solve medical queries

11
Experimental
92 trjo1/genaiwithllms

Fine-tuned FLAN T-5 using Instruction Fine-Tuning (Full), LoRA-based PEFT,...

11
Experimental
93 Maximo-Rulli/PoLLiBLOOM

Fine-tuning BLOOM to generate Polimi style physics excercises

11
Experimental
94 Oyebamiji-Micheal/Llama-for-UTME-preparation

Fine-tuning Llama on past UTME questions using unsloth

11
Experimental
95 Jaskirat-singh04/Tunewizard

This is the official Github Repo for Tunewizard-GUI Based Fine-Tuning of...

11
Experimental
96 maidacundo/falcon-7b-sql

Implementation for fine-tuning a Falcon-7b model using QLoRA on the Spider...

10
Experimental
97 naveen-v-v/LLM_fine_tune_lora

Fine tune a Large Language Model using LORA to perform Sentiment Analysis

10
Experimental
98 Seanaaa0/GPT-CoT

Fine-tuning Phi-2 with LoRA for grid-based spatial reasoning and...

10
Experimental
99 sdpetrides/t5x-train-and-test

Pre-training and fine-tuning experiments with T5

10
Experimental
100 Ravi-Teja-konda/TunedLlavaDelights

Explore the rich flavors of Indian desserts with TunedLlavaDelights....

10
Experimental