LoRA QLoRA Fine-tuning Transformer Models

Tools and frameworks for parameter-efficient fine-tuning of LLMs using LoRA, QLoRA, and related subspace tuning methods on consumer hardware. Does NOT include general fine-tuning without these specific techniques, model compression, or task-specific applications unless they primarily demonstrate these adaptation methods.

There are 230 lora qlora fine-tuning models tracked. 5 score above 70 (verified tier). The highest-rated is unslothai/unsloth at 81/100 with 53,879 stars. 7 of the top 10 are actively maintained.

Get all 230 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=lora-qlora-fine-tuning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss,...

81
Verified
2 huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

80
Verified
3 modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,...

78
Verified
4 oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any...

77
Verified
5 linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

77
Verified
6 hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

67
Established
7 stochasticai/xTuring

Build, personalize and control your own LLMs. From data pre-processing to...

64
Established
8 h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs....

63
Established
9 dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with...

57
Established
10 roboflow/maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2,...

55
Established
11 TUDB-Labs/mLoRA

An Efficient "Factory" to Build Multiple LoRA Adapters

48
Emerging
12 mallorbc/Finetune_LLMs

Repo for fine-tuning Casual LLMs

48
Emerging
13 predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

47
Emerging
14 Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient...

46
Emerging
15 ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

45
Emerging
16 Bavest/fin-llama

LLAMA specialized on finance

45
Emerging
17 lxe/simple-llm-finetuner

Simple UI for LLM Model Finetuning

44
Emerging
18 jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen,...

44
Emerging
19 kamalkraj/e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

44
Emerging
20 GAIR-NLP/MegaScience

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

43
Emerging
21 jasonvanf/llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

43
Emerging
22 AviSoori1x/Tuning-the-Finetuning

Tuning the Finetuning: An exploration of achieving success with QLoRA

43
Emerging
23 ethicalabs-ai/kurtis

Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small...

43
Emerging
24 ruimalheiro/training-custom-llama

Llama-style transformer in PyTorch with multi-node / multi-GPU training....

43
Emerging
25 ssbuild/deep_training

deep learning

42
Emerging
26 zetavg/LLaMA-LoRA-Tuner

UI tool for fine-tuning and testing your own LoRA models base on LLaMA,...

42
Emerging
27 Gunale0926/SORSA

SORSA: Singular Values and Orthonormal Regularized Singular Vectors...

42
Emerging
28 leehanchung/lora-instruct

Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA

42
Emerging
29 okuvshynov/slowllama

Finetune llama2-70b and codellama on MacBook Air without quantization

41
Emerging
30 Guitaricet/relora

Official code for ReLoRA from the paper Stack More Layers Differently:...

41
Emerging
31 Beomi/Gemma-EasyLM

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

41
Emerging
32 princeton-nlp/LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

41
Emerging
33 wuwangzhang1216/prometheus

Fully automatic censorship removal for language models. LoRA abliteration +...

40
Emerging
34 google-deepmind/gemma_penzai

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding...

40
Emerging
35 git-cloner/llama-lora-fine-tuning

llama fine-tuning with lora

40
Emerging
36 Shannon-Labs/shannon-control-unit

Shannon Control Unit: Adaptive regularization via control theory for LLM training

39
Emerging
37 misonsky/HiFT

memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B

39
Emerging
38 shrut2702/upasak

UI-based Fine-Tuning for Large Language Models (LLMs)

39
Emerging
39 MURUGESAN88709/mental-health-finetuned-llama

🧠 Fine-tune LLaMA for mental health applications, providing insights and...

39
Emerging
40 GAIR-NLP/OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

39
Emerging
41 ariG23498/gemma3-object-detection

Fine tune Gemma 3 on an object detection task

39
Emerging
42 deep-div/Fine-Tuning-LLMs-and-VisionModels

Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to...

38
Emerging
43 minosvasilias/godot-dodo

Finetuning large language models for GDScript generation.

38
Emerging
44 NisaarAgharia/Indian-LawyerGPT

Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model...

38
Emerging
45 muhammad-fiaz/finetune-web-ui

Finetune Web UI is a user-interface for training and deploying pre-trained models.

37
Emerging
46 benitomartin/food-images-finetuning

Fine-tuning of LiquidAI LFM2-VL vision-language models on food image...

37
Emerging
47 YassWorks/Tuna

Python library that makes fine-tuning transformer-based models easier and faster.

37
Emerging
48 poteminr/instruct-ner

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models...

37
Emerging
49 avocardio/Zicklein

Finetuning instruct-LLaMA on german datasets.

36
Emerging
50 frankluise5220/ComfyUI-Lorahelper

A professional automation toolkit for ComfyUI to prepare LoRA training data...

35
Emerging
51 jorgemunozl/Finetunning-Llama-Vision-11b

Inference and finnetunning of a VLM (LLama Vision 11b) using the Unsloth,...

34
Emerging
52 michaelnny/QLoRA-LLM

A simple custom QLoRA implementation for fine-tuning a language model (LLM)...

34
Emerging
53 adithya-s-k/CompanionLLM

CompanionLLM - A framework to finetune LLMs to be your own sentient...

33
Emerging
54 rabiloo/llm-finetuning

Sample for Fine-Tuning LLMs & VLMs

33
Emerging
55 UCDvision/NOLA

Code for NOLA, an implementation of "nola: Compressing LoRA using Linear...

33
Emerging
56 monk1337/NanoPeft

The simplest repository & Neat implementation of different Lora methods for...

33
Emerging
57 srsawant34/efficient_instruction_learning

Code base for the paper "Instruction Tuned Models are Quick Learners".

33
Emerging
58 babycommando/neuralgraffiti

Live-bending a foundation model’s output at neural network level.

32
Emerging
59 Simplifine-gamedev/Simplifine

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud...

32
Emerging
60 MaxwellYaoNi/PACE

[NeurIPS 2024 Spotlight] Official implementation for "PACE: marrying...

32
Emerging
61 Joyce94/LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

31
Emerging
62 ssbuild/llm_finetuning

Large language Model fintuning bloom , opt , gpt, gpt2...

30
Emerging
63 EvilFreelancer/MoDA

Is a framework designed to enhance the performance and flexibility of large...

30
Emerging
64 yunkai1841/recipe-generation

NLP Text generation task. Generate recipe by fine tuned LLaMA model.

30
Emerging
65 calhounpaul/LLaMA-PEFT-LoRa-subreddit-chatbot-colab

Parameter Efficient Fine Tuning (PEFT) to create a chatbot from Facebook's...

30
Emerging
66 fshnkarimi/Fine-tuning-an-LLM-using-LoRA

📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models -...

30
Emerging
67 CoffeeVampir3/ez-trainer

Train Llama Loras Easily

29
Experimental
68 awilliamson10/clipora

Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank...

29
Experimental
69 gmongaras/Wizard_QLoRA_Finetuning

Finetuning Some Wizard Models With QLoRA

29
Experimental
70 Cyr-Ch/lora-qlora-101

Parameter-efficient fine-tuning toolkit for LLMs using LoRA and QLoRA....

29
Experimental
71 HemantBK/LLaMA-Sum-Fine-Tuning

Fine-tuned Meta's LLaMA 3.2 1B for text summarization using QLoRA (4-bit...

28
Experimental
72 LegendLeoChen/llm-finetune

使用trl、peft、transformers等库,实现对huggingface上模型的微调。

28
Experimental
73 Beomi/easy-lm-trainer

🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드

28
Experimental
74 Cre4T3Tiv3/unsloth-llama3-alpaca-lora

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with...

28
Experimental
75 plutonium-239/memsave_torch

Lowering PyTorch's Memory Consumption for Selective Differentiation

27
Experimental
76 AbineshSivakumar/Llama-2-7B-QLoRA-Vicuna

This repository contains code to fine-tune a Llama-7B-Uncensored model using...

27
Experimental
77 ArchitJ6/Llama2-FineTuning

🦙 Llama2-FineTuning: Fine-tune LLAMA 2 with Custom Datasets Using LoRA and...

27
Experimental
78 Miraclemarvel55/LLaMA-MOSS-RLHF-LoRA

用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]

26
Experimental
79 illoonego/gemma-finetune-emails

LoRA fine-tuning pipeline for Google’s Gemma-2B language model to classify...

26
Experimental
80 RangiLyu/llama.mmengine

Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!

26
Experimental
81 Jiacheng-Zhu-AIML/AsymmetryLoRA

Preprint: Asymmetry in Low-Rank Adapters of Foundation Models

25
Experimental
82 DrRuin/Lightweight-Fine-Tuning

Lightweight fine-tuning is one of the most important techniques for adapting...

25
Experimental
83 yuki-2025/llama3-8b-fine-tuning-math

Fine-Tuning Llama 3-8B for Structured Math Reasoning: Fine-tuning Llama3 8b...

25
Experimental
84 PRITHIVSAKTHIUR/Qwen-Image-LoRA-DLC

Qwen-Image model with various LoRA (Low-Rank Adaptation) styles. This tool...

24
Experimental
85 Nutanpatil06/Fine-Tuning-LLM-with-LLaMA-Factory

Complete LoRA/QLoRA implementation using LLaMA Factory. Fine-tune models...

24
Experimental
86 opencodeiiita/Finetuning_Llama

Fine-Tuning LLaMA for Indian Laws

24
Experimental
87 Josephrp/SmolFactory

finetune gpt-oss and smollm3 on your data easily and cheaply

24
Experimental
88 mirzayasirabdullahbaig07/Fine-Tuning-LLaMA-3.2-3B-Using-PEFT-LoRA

This project showcases parameter-efficient fine-tuning of the LLaMA 3.2 (3B)...

24
Experimental
89 Md-Emon-Hasan/Fine-Tuning

End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA,...

23
Experimental
90 viktor-shcherb/llm-tool-call-sft

LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven...

23
Experimental
91 ethicalabs-ai/FlowerTune-Qwen2.5-Coder-0.5B-Instruct

FlowerTune LLM on Coding Dataset

23
Experimental
92 MakazhanAlpamys/Soup

Soup turns the pain of LLM fine-tuning into a simple workflow. One config,...

23
Experimental
93 nakjun/Llama-3.2-1B-Instruct-KorQuAD-Finetune

Llama-3.2-1B-Instruct model and KorQuAD dataset Finetune project

22
Experimental
94 Kentakoong/optimize-finetuning-llm

A repository where we Optimize the Fine-tuning process of LLMs in LANTA

22
Experimental
95 MarinaComotti/Finance_Specialist_AI

Fine-Tuning Llama 3 for Financial Question-Answering

22
Experimental
96 sandeeppanem/qwen3-resume-extraction

Fine-tune Qwen3-0.6B for resume parsing using LoRA

22
Experimental
97 AswaniSahoo/llama-task-agent

Fine-tuned LLaMA-3.1-8B task agent with LoRA for reliable tool execution

22
Experimental
98 NamelyCorp/NamelyCorp-LLM-Studio

Local-first LoRA fine-tuning studio with web UI for document-grounded LLM training.

22
Experimental
99 mamounyosef/commit-message-llm

Fine-tuning Qwen2.5-Coder-0.5B LLM using QLoRA (4-bit quantization + LoRA)...

22
Experimental
100 xHarshit/Self-Healing-Classification-DAG-with-Fine-Tuned-Model

A self-healing text classification pipeline built with LangGraph and a...

22
Experimental
101 matteo-stat/transformers-llm-llama3.1-fine-tuning-qlora

This repo offers scripts for fine-tuning LLaMA 3.1 models with QLoRA,...

22
Experimental
102 benitomartin/peft-gemma-2b

Fine Tuning Gemma 2B

22
Experimental
103 Geenukaneco/NamelyCorp-LLM-Studio

📄 Build document-grounded language models with ease using NamelyCorp LLM...

22
Experimental
104 Thariya13/medical-llm-lora

🧠 Fine-tuning a medical reasoning LLM with LoRA 🚀 — Step-by-step project to...

22
Experimental
105 kossisoroyce/Gemma-3n-local-training

A lightweight, GPU-focused framework to run inference and LoRA fine-tuning...

22
Experimental
106 artryazanov/nitrogen-finetuner

This project implements a Universal Fine-Tuning Pipeline for the NVIDIA...

22
Experimental
107 dasdristanta13/LLM-Lora-PEFT_accumulate

LLM-Lora-PEFT_accumulate explores optimizations for Large Language Models...

22
Experimental
108 aman-17/MediSOAP

FineTuning LLMs on conversational medical dataset.

22
Experimental
109 rizalsimb1/ml-monitoring

Fine-tune large language models (Llama 3, Mistral, Phi-3) with LoRA and...

21
Experimental
110 PRITHIVSAKTHIUR/Qwen-Image-Edit-2511-LoRAs-Fast-Single-Image-Rerun

Experimental demonstration for the Qwen/Qwen-Image-Edit-2511 model with...

21
Experimental
111 shuhulx/FineTuneCheck

Diagnostic tool for LLM fine-tuning — automated forgetting detection,...

21
Experimental
112 Abhijeet-ist/FineTunning

This is a short script based on fine tuning a open sourced LLM based on...

21
Experimental
113 zachdwight/lora-model-builder-for-AI-chef

Fine-tune any Hugging Face LLM on your own recipe dataset using LoRA + 4-bit...

21
Experimental
114 min234/se_game

LLaMA fine-tuned AI — YouTube script generator & code review assistant

21
Experimental
115 BUAADreamer/MLLM-Finetuning-Demo

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

21
Experimental
116 ducnh279/LLMs-for-Text-Classification

Fine-tuning Large Language Models (LLMs) for Text Classification Task

21
Experimental
117 GeiserX/epub-and-vtt-to-llm

Given .epub and .vtt files, train and infere a new LLM

21
Experimental
118 alexsuw/easylora

Batteries-included toolkit for LoRA / QLoRA fine-tuning with Hugging Face...

21
Experimental
119 Saeed-Mahmoud/lora-peft-customer-support-chatbot

Parameter-efficient fine-tuning (LoRA/PEFT) for a customer support chatbot...

21
Experimental
120 harshitdhar9/gemma_peft_finetuned

Fine-tuning gemma model using peft techniques

21
Experimental
121 LoicSERRE/mistral-qlora-decp

Fine-tuning Mistral 7B v0.3 avec QLoRA sur données publiques territoriales...

21
Experimental
122 videogramme/Qwen-Image-Edit-2511-LoRAs-Fast-Single-Image-Rerun

🖼️ Edit images swiftly with the Qwen-Image-Edit-2511 model, featuring...

21
Experimental
123 danloi2/laesla-llm

laesla-LLM is a Fine-tunes Meta NLLB-200 with MASSIVE DATASETS (OPUS Bible +...

21
Experimental
124 keresifon/igbo-model-training

Igbo-English translation model training using AWS SageMaker. Fine-tuned...

21
Experimental
125 MahmoudAbusaqer/LLMs-Lora-Finetuning-vs-Zeroshot-Classification

Official implementation comparing parameter-efficient LoRA fine-tuning...

21
Experimental
126 A-SHOJAEI/MoLE-LoRA

MoLE: Mixture of LoRA Experts - Parameter-efficient mixture-of-experts using...

21
Experimental
127 lionajuanabel/Fine-Dllm

LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven...

21
Experimental
128 Muhammad-Hammad-59/Qwen05B-lora-qlora-finetuning-for-customer-support

Parameter-efficient fine-tuning (LoRA + QLoRA) of Qwen2.5-0.5B-Instruct for...

21
Experimental
129 kantkrishan0206-crypto/LoRAForge-

Build a production‑grade, modular pipeline for fine‑tuning large language...

20
Experimental
130 ShadowMonarchX/Finetuning-LLM-main

A practical repo for fine-tuning LLMs using QLoRA, PEFT, and other efficient...

20
Experimental
131 mazurkin/ptn

train own virtual "PTN" LLM model

20
Experimental
132 Spectrewolf8/PHi-3-SQL-generation-fine-tune-experiment

A fine-tuned version of Phi-3-mini-4k-instruct for generating SQL queries...

20
Experimental
133 Ashish-kharde1/Micro-Reasoner-Qwen

Lightweight reasoning-capable LLM built on Qwen3-4B using LoRA and 4-bit inference

20
Experimental
134 pramodkoujalagi/SmolLM2-360M-Instruct-Text-2-JSON

A fine-tuned version of SmolLM2-360M-Instruct-bnb-4bit specialized for...

20
Experimental
135 Pavansomisetty21/Qwen2-Vision-Finetuning-Unsloth---Maths-OCR-Formulae-Extraction-

we finetune unsloth llama model to extract mathematical fomulas in the...

19
Experimental
136 Shengwei-Peng/Classical-Chinese-Translation

A project for bidirectional translation between Classical Chinese and modern...

19
Experimental
137 lucaslingle/e-lra

Streamlined variant of Long-Range Arena with pinned dependencies, automated...

19
Experimental
138 Logisx/LLMath-QLoRA

🧮 End-to-end LLM instruction finetuning based on PEFT & QLoRA to solve math problems.

19
Experimental
139 arham-kk/llama2-qlora-sft

This model is a fine-tuned model based on the...

19
Experimental
140 Raxephion/loRA-Epoch-Analyser

A Python script to analyze images generated at different epochs of LoRA...

19
Experimental
141 ethicalabs-ai/BlossomTuneLLM

Federated Supervised Fine-Tuning for Small Language Models (SLMs)

19
Experimental
142 ankraj1234/MediGuide

Comparing QLoRA, Prompt & Prefix Tuning on Mistral-7B for medical...

19
Experimental
143 Pavansomisetty21/LlamaNLP-Unsloth-Next-Gen-Text-Processing-with-Llama-and-Unsloth

In this we generate NER ,Question Answering and text generation using...

18
Experimental
144 doem97/ICLR26_mtLoRA

[ICLR 2026] Official implementation (Claude Agent reproduce supported) of...

18
Experimental
145 Pavansomisetty21/Finetune-llama-model-for-Text-Generation-using-unsloth

In this we finetune Llama-3.2-3B-Instruct model for text generation using unsloth

18
Experimental
146 Souptik96/efficient-domain-tuning

Research paper on efficient fine tuning of small sized open source models...

18
Experimental
147 arjunravi26/mental-health-finetuned-llama

A LLM(llama) finetuned for work well with mental health assistance

18
Experimental
148 Vitgracer/LLM-lab

LLM research lab: exploring fine-tuning, base models, architectures and more

18
Experimental
149 prasanna00019/Fine-Tuning-LLMs

Fine-tuning GPT-2 and other models on tasks like classification, translation...

18
Experimental
150 Seanaaa0/QT-R1

STaR × S1 math pipeline on Qwen2.5-1.5B. LoRA, strict Final: format, ~20–30%...

18
Experimental
151 makr-code/VCC-Clara

Clara AI System - Machine learning with continuous training, multi-GPU...

17
Experimental
152 FlosMume/LLAMA-qLoRA-Unsloth-Starter

Fine-tuning Llama models with QLoRA using Unsloth for supervised instruction tasks

17
Experimental
153 North-Shore-AI/hf_peft_ex

Elixir port of HuggingFace's PEFT (Parameter-Efficient Fine-Tuning) library....

17
Experimental
154 gsmoon97/llm-semantic-understanding

A comprehensive framework for fine-tuning and evaluating Large Language...

17
Experimental
155 rplacucci/PEFT

PyTorch implementation of the most common Parameter Efficient Fine-Tuning...

17
Experimental
156 dean-brown1/sb_poc_V4

SchemaBank: 3x improvement over LoRA via sparse routing as training...

17
Experimental
157 hasanhalacli/llama-3.2-finetuning

Fine-tune Llama 3.2 with QLoRA - memory-efficient training, Alpaca/ShareGPT...

17
Experimental
158 yuchengml/Adaptation-Tuning-PEFT

Comparison of different adaptation methods on PEFT for fine-tuning...

17
Experimental
159 sunil-dhaka/finetuning-llms

an exercise in finetuning pythia suites of open source models for...

17
Experimental
160 green5321/efficient_tuning_llms_python

This is the repo for the Efficient Finetuning of Quantized LLMs project,...

17
Experimental
161 sagarvk24/Fine-Tune-LLMs-

This repository contains comprehensive notebooks, made by me, explaining how...

17
Experimental
162 alenzenx/WindowsEasyFinetuneLLM

Finetune LLM using Torchtune on Windows

17
Experimental
163 hubertik1/emotion-prediction-finetuning

Fine-tune Qwen2.5-VL-7B with LoRA to predict human-rated emotion intensity...

16
Experimental
164 Franekskc/gemma3-qa-finetuning

Comparing Full Fine-Tuning, LoRA, and Layer Freezing for extractive QA on...

16
Experimental
165 Samarth2001/LLM-Fine-tuning

Parameter-efficient fine-tuning experiments for 7B LLMs on consumer...

15
Experimental
166 murapadev/Phinetuning

A repository dedicated to finetuning phi2 models using advanced machine...

15
Experimental
167 quocnhut134/Finetuning-LLM-Model-for-Intent-Classification-in-Banking

Fine-tuning Large Language Models (LLMs) for precise customer intent...

15
Experimental
168 edersoncorbari/fine-tune-llm

Demonstrate how to fine-tune a pre-trained LLM

15
Experimental
169 krishnakoushik225/ecg-peft-benchmark

Benchmarking PEFT (LoRA vs adapters) for ECG segment classification using...

14
Experimental
170 Josh396s/BERT-Contrastive-LoRA

Optimizing BERT for intent classification on the Amazon Massive dataset...

14
Experimental
171 rawatshaurya/LORA-vs-QLORA

Reasoning-style fine-tuning of an instruction LLM using LoRA vs QLoRA,...

14
Experimental
172 nehamaheshh/Reasoning-style-fine-tuning-PEFT

LoRA vs QLoRA fine-tuning on CommonsenseQA measuring accuracy, GPU memory,...

13
Experimental
173 Accrame/finllm-sentiment

Fine-tuning LLMs for financial sentiment analysis with QLoRA

13
Experimental
174 shubham5027/LLM-Finetuning

This repository hands-on fine-tuning experiments on Large Language Models...

13
Experimental
175 BharathiDonku7/Fine_tune_with_Mistral_with_QLORA_PEFT

Fine-tuning Mistral-7B and LLaMA using QLoRA & PEFT for efficient LLM...

13
Experimental
176 krutarth3238/slm-lora-finetuning

LoRA-based fine-tuning of GPT-2 using multi-domain datasets with PEFT and...

13
Experimental
177 aneessaheba/peft-banking-classifier

Fine-tunes Google FLAN-T5-base with LoRA (PEFT) on Banking77 to classify 77...

13
Experimental
178 kaiser-data/llm-finetune-kit

🚀 Beginner-friendly Python library for fine-tuning LLMs. 3-line training,...

13
Experimental
179 Himanshu0508Raturi/Fine_Tuning-LLM

This repository shows how to fine tune Llama3.2-Instruct model on custom...

13
Experimental
180 nikolareljin/finetorch

Rust-native LLM finetuning toolkit for LoRA/QLoRA, dataset preparation,...

13
Experimental
181 swati-mishra07/mcq-rag-app

LoRA fine-tuned FLAN-T5 model for automatic MCQ generation with evaluation...

13
Experimental
182 Desire32/lora-ml-transfomers

LoRA / RAG fine-tuning

13
Experimental
183 azadero/Emotion-Classification-DistilBERT-LoRA

A deep learning project focused on fine-tuning DistilBERT for emotion...

13
Experimental
184 meleknurb/llm-finetuning-with-qlora

Codecademy project focused on fine-tuning an LLM using QLoRA with the...

13
Experimental
185 hsb943/lora-tinyllama-finetuning

End-to-end LoRA fine-tuning of TinyLlama (1.1B) using 4-bit quantization on...

13
Experimental
186 anishdulal/llm-cuad-eval

Evaluation of Llama 3.2 3B on CUAD Dataset - before and after finetuning

13
Experimental
187 PrateekKacham/mistral-7b-text2sql-finetuning

Fine-tuning Mistral 7B for Text-to-SQL generation using QLoRA — 200%...

13
Experimental
188 Travor278/SEED-LLaVA

Single-GPU reproduction of SEED for hallucination mitigation in...

13
Experimental
189 alfredang/finetuning-llm-huggingface

🤖 Fine-tune Qwen3-0.6B for IT support ticket routing using LoRA + Unsloth....

13
Experimental
190 johnayoung/eth-finetuning-cookbook

Educational cookbook for fine-tuning LLMs on Ethereum transaction data using QLoRA

13
Experimental
191 punyamodi/lora-finetune-studio

Full-stack LoRA fine-tuning studio for large language models with Gradio UI,...

13
Experimental
192 Dhyani2206/Domain_Specialized_LLaMA

Fine-tuning LLaMA-3, Mistral-7B, and Phi-3 using QLoRA on a curated Data...

13
Experimental
193 SCCSMARTCODE/Deep-Learning-03-LLM-FineTuning

Scalable and modular framework for fine-tuning large language models (LLMs)...

13
Experimental
194 mxagar/llm_peft_fine_tuning_example

Example project in which a Large Language Model is fine-tuned using PEFT.

13
Experimental
195 IlyyinKashaf/MarketingMuse

Fine-tuned TinyLlama-1.1B (Decoder-Only) via 3-phase training (domain...

13
Experimental
196 sdtrkl/lightweight-fine-tuning

This project is part of Generative AI Nanodegree by Udacity

13
Experimental
197 Mo-Shakib/llama3.2-3b-lora-finetuning-kit

Fast, memory-efficient LoRA fine-tuning toolkit for...

13
Experimental
198 SamsungSAILMontreal/mulo

μLO: Compute-Efficient Meta-Generalization of Learned Optimizers [to appear...

13
Experimental
199 rizalsimb1/context-manager

Fine-tune large language models (Llama 3, Mistral, Phi-3) with LoRA and...

13
Experimental
200 anilsrml/LLM-FineTuning-QLora

Kumru-2B büyük dil modelinin, tıbbi veriler (TUS sınavı) üzerinde QLoRA...

13
Experimental
201 Rishi625/LLM-Finetune-Pipeline

Production-grade ML pipeline for Llama 3.2 fine-tuning with LoRA/QLoRA,...

13
Experimental
202 ZeeetOne/bioinstruct-finetuning-experiment

LoRA fine-tuning experiment: Llama-3.2-1B-Instruct + BioInstruct dataset...

13
Experimental
203 soniatyburczy/llama2-qlora-sft-coverletter-project

Implementation of a task-specific QLoRA supervised fine-tuning pipeline for...

12
Experimental
204 aasherkamal216/15_Days_Fine_Tuning_Challenge

A collection of notebooks for fine-tuning LLMs using Unsloth AI

12
Experimental
205 azizdeniz890/gemma-medical-qa-lora

This project presents a medical question–answering language model built by...

12
Experimental
206 notdanna/GPoemsT

A modular Transformer-based architecture for constrained poetic generation,...

12
Experimental
207 nabeelshan78/math-vlm-finetune-pipeline

A production-ready, modular fine-tuning pipeline for converting handwritten...

12
Experimental
208 programindz/lora-vit-finetuning

Fine-Tuning Google's Vision Transformer LoRA technique. Two different LoRA...

12
Experimental
209 Sid3503/LoRA

A beginner-friendly guide to Low-Rank Adaptation (LoRA) - the efficient...

11
Experimental
210 mominalix/LLM-Finetuning-Pipeline-LoRA-QLoRA

Production-ready pipeline for fine-tuning Large Language Models using...

11
Experimental
211 benhaotang/LoRa-Finetune-Navi

finetune models with AMD navi cards like 7700/7800/7900XT

11
Experimental
212 hugoaslm/Mesh-Generation-Medical-Shapes

Efficient LLaMA-Mesh fine-tuning to produce printable medical meshes

11
Experimental
213 andresnowak/mnlp_mcqa_model

Finetuning of Qwen3 0.6B for MCQA tasks

11
Experimental
214 RATHOD-SHUBHAM/Finetuning-LLMs

This repository contains experiments on fine-tuning LLMs (Llama, Llama3.1,...

11
Experimental
215 AlbertChoo/NLP-Projects

Hmm, projects with, transformers, pre-trained model, finetuning LLM using...

11
Experimental
216 tien02/llm-math

Fine tune Large Language Model on Mathematic dataset

11
Experimental
217 Reason-Wang/InstructLLM

The official implementation of paper "Demystifying Instruction Mixing for...

11
Experimental
218 Morsinaldo/GAIND-Light-Weight-Fine-Tuning

This repository contains the Light Weight Fine Tuning project implementation...

11
Experimental
219 arjuntheprogrammer/RLHF-LoRA-PyTorch-Llama-3.1-8B

Full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware.

11
Experimental
220 thekaranacharya/llm-fine-tuning

Comparing popular Parameter Efficient Fine-Tuning (PEFT) techniques for...

11
Experimental
221 flozi00/simplepeft

An simple trainer for efficient finetuning large models on different tasks

11
Experimental
222 veydantkatyal/Llama-LoRA-FineTuning

Practical guide to fine-tuning the LLaMA model for dialogue summarisation...

11
Experimental
223 dannylee1020/panim

Generate manim code with LLM

10
Experimental
224 antonio-f/llama2_colab

Fine-Tune Your Own Llama 2 Model LOCALLY in a Colab Notebook

10
Experimental
225 chanupadeshan/LoRA-Fine-Tune

Fine-tuning TinyLlama-1.1B-Chat using LoRA on the GSM8K dataset for solving...

10
Experimental
226 KindYAK/kaggle_20q

Solution for Kaggle 20 Questions competetion

10
Experimental
227 Alessio2405/fine_tuning_llama_2_Xb

Python script to fine tune LLaMA-Xb with your custom data.

10
Experimental
228 dhdbsrlw/Instruct-Tune-LLaMA-with-PEFT-Techniques

COSE474 DL Project (Prof. Hyun-Woo Kim)

10
Experimental
229 Gholamrezadar/finetune-llama-3.2-qlora-text2sql

Finetuning Llama-3.2-3B on Text2SQL using QLoRA

10
Experimental
230 nebrelbug/llm_trainer

Comprehensible scripts to instruction-tune a LLaMA model

10
Experimental