Llm Fine Tuning Transformer Models

There are 212 llm fine tuning models tracked. 4 score above 50 (established tier). The highest-rated is OptimalScale/LMFlow at 59/100 with 8,489 stars. 1 of the top 10 are actively maintained.

Get all 212 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-fine-tuning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation...

59
Established
2 adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

57
Established
3 jax-ml/jax-llm-examples

Minimal yet performant LLM examples in pure JAX

53
Established
4 young-geng/scalax

A simple library for scaling up JAX programs

52
Established
5 riyanshibohra/TuneKit

Upload your data → Get a fine-tuned SLM. Free.

49
Emerging
6 JIA-Lab-research/LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

46
Emerging
7 georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

46
Emerging
8 kyegomez/Finetuning-Suite

Finetune any model on HF in less than 30 seconds

45
Emerging
9 MaximeRobeyns/bayesian_lora

Bayesian Low-Rank Adaptation for Large Language Models

45
Emerging
10 NVlabs/EoRA

[ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with...

45
Emerging
11 ZinYY/TreeLoRA

A pytorch implementation of the paper "TreeLoRA: Efficient Continual...

44
Emerging
12 SakanaAI/text-to-lora

Hypernetworks that adapt LLMs for specific benchmark tasks using only...

44
Emerging
13 rohan-paul/LLM-FineTuning-Large-Language-Models

LLM (Large Language Model) FineTuning

43
Emerging
14 SensAI-PT/LLaMa2lang

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

42
Emerging
15 A-baoYang/alpaca-7b-chinese

Finetune LLaMA-7B with Chinese instruction datasets

42
Emerging
16 VectorInstitute/vectorlm

LLM finetuning in resource-constrained environments.

41
Emerging
17 bigscience-workshop/xmtf

Crosslingual Generalization through Multitask Finetuning

41
Emerging
18 NVlabs/DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed...

41
Emerging
19 liuqidong07/MOELoRA-peft

[SIGIR'24] The official implementation code of MOELoRA.

41
Emerging
20 punica-ai/punica

Serving multiple LoRA finetuned LLM as one

41
Emerging
21 sandy1990418/Finetune-Qwen2.5-VL

Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision...

41
Emerging
22 architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction

Exploring the potential of fine-tuning Large Language Models (LLMs) like...

40
Emerging
23 molbal/llm-text-completion-finetune

Guide on text completion large language model fine-tuning, including example...

40
Emerging
24 rasbt/blog-finetuning-llama-adapters

Supplementary material for "Understanding Parameter-Efficient Finetuning of...

40
Emerging
25 metriccoders/one-line-llm-tuner

This repository is the source code for fine tuning any LLM in just one line 🔥

40
Emerging
26 AlexandrosChrtn/llama-fine-tune-guide

Fine-tune the newly released Llama-3.2 lightweight models.

40
Emerging
27 rasbt/dora-from-scratch

LoRA and DoRA from Scratch Implementations

39
Emerging
28 neuralwork/instruct-finetune-mistral

Fine-tune Mistral 7B to generate fashion style suggestions

39
Emerging
29 anchen1011/FireAct

FireAct: Toward Language Agent Fine-tuning

39
Emerging
30 EricLBuehler/xlora

X-LoRA: Mixture of LoRA Experts

39
Emerging
31 TrelisResearch/install-guides

Various installation guides for Large Language Models

38
Emerging
32 di37/finetuning-quantize-evaluate

Fine-Tune, Quantize, Evaluate: The Complete Guide — LLMs, VLMs, and Embedding Models

38
Emerging
33 ymoslem/Adaptive-MT-LLM-Fine-tuning

Fine-tuning Open-Source LLMs for Adaptive Machine Translation

38
Emerging
34 GiovanniGatti/socratic-llm

Training pipeline for fine tuning Phi-3-mini-instruct to follow the Socratic method

38
Emerging
35 readytensor/rt-llm-eng-cert-week3

Week 3 of LLM Engineering Certification: Learn to fine-tune large language...

38
Emerging
36 aws-samples/fine-tuning-llm-with-domain-knowledge

This repo walks you through how to use transfer learning to fine tune a LLM...

37
Emerging
37 zjohn77/lightning-mlflow-hf

Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow

37
Emerging
38 promptslab/LLMtuner

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

37
Emerging
39 openmedlab/PULSE

PULSE: Pretrained and Unified Language Service Engine

37
Emerging
40 ksm26/Finetuning-Large-Language-Models

Unlock the potential of finetuning Large Language Models (LLMs). Learn from...

37
Emerging
41 poloclub/Fine-tuning-LLMs

Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial

37
Emerging
42 NgJaBach/dark-kit

Collect and share guidance + code snippets for running LM-related tasks.

36
Emerging
43 SculptAI/GIMKit

Guided Infilling Modeling Toolkit

36
Emerging
44 Yog-Sotho/LLM-fine-tuner

Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes....

35
Emerging
45 researchim-ai/models-at-home

training models at home

35
Emerging
46 ngoanpv/llama2_vietnamese

A fine-tuned Large Language Model (LLM) for the Vietnamese language based on...

35
Emerging
47 eliahuhorwitz/Spectral-DeTuning

Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning...

34
Emerging
48 MNoorFawi/curlora

The code repository for the CURLoRA research paper. Stable LLM continual...

34
Emerging
49 CristianCristanchoT/chivito

Implementación de un LLM basado en Llama finetuneado en español empleando...

34
Emerging
50 rasbt/gradient-accumulation-blog

Finetuning BLOOM on a single GPU using gradient-accumulation

34
Emerging
51 Pengxin-Guo/FedSA-LoRA

Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025]

34
Emerging
52 GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning

Llama 3 ORPO Fine Tuning on A100 in Colab Pro.

34
Emerging
53 XavierSpycy/hands-on-lora

Explore practical fine-tuning of LLMs with Hands-on Lora. Dive into examples...

33
Emerging
54 DoubleVII/lithft

Pretrain, finetune any LLMs from huggingface on your own data.

33
Emerging
55 jianzhnie/LLMToolkit

LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large...

33
Emerging
56 ramalamadingdong/onnx-rubikpi

ONNX LLM runtime on RUBIK-Pi with Gemma 1B and Llama 3.2 1B

32
Emerging
57 juzhengz/LoRI

[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

32
Emerging
58 mddunlap924/PyTorch-LLM

Fine-tuning an LLM using a Generic Workflow and Best Practices with PyTorch

32
Emerging
59 BFCmath/FinetuneAI_Learning

How to effectively finetune CV/LLM models (without local gpu)

32
Emerging
60 samadon1/LLM-From-Scratch

Medical Language Model fine-tuned using pretraining, instruction tuning, and...

32
Emerging
61 naity/finetune-esm

Scalable Protein Language Model Finetuning with Distributed Learning and...

31
Emerging
62 j-webtek/Local-LLM_FineTune

Finetune Your Local LLM

31
Emerging
63 yangjianxin1/LongQLoRA

LongQLoRA: Extent Context Length of LLMs Efficiently

31
Emerging
64 serp-ai/LLaMA-8bit-LoRA

Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on...

31
Emerging
65 graphcore-research/jax-scalify

JAX Scalify: end-to-end scaled arithmetics

31
Emerging
66 Followb1ind1y/Medical-LLM-Fine-tuning

Fine-tunes LLaMA-3-8B on PubMedQA with QLoRA, optimized via DeepSpeed and...

30
Emerging
67 ambideXtrous9/GRPO-and-SFT-Finetune-Qwen3-using-Unsloth-Reasoning-and-Non-Reasoning-Dataset

GRPO and SFT Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset

30
Emerging
68 sukanyabag/Finetuning-Qwen2-7B-VQA-on-Radiology-Scans

This repository is doing the finetuning of the Qwen2 7B VLM for performing...

30
Emerging
69 DianaDorobantu/legal-llm

Develop a Romanian legal domain Large Language Model (LLM) using pre-trained...

30
Emerging
70 francoislanc/midistral

LLM finetuned for generating symbolic music

29
Experimental
71 Atomic-man007/falcon-7b-lora-fine-tuning

falcon-7b-lora-fine-tuning

29
Experimental
72 mehdihosseinimoghadam/AVA-Llama-3

Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3

29
Experimental
73 PardhuSreeRushiVarma20060119/OpenLoRA

"OpenLoRa" is designed to streamline and elevate the fine-tuning of large...

29
Experimental
74 Abhi0323/Fine-Tuning-LLaMA-2-with-QLORA-and-PEFT

This project enhances the LLaMA-2 model using Quantized Low-Rank Adaptation...

28
Experimental
75 roy-sub/LLM-FineTuning

Fine-Tuned Language Models Exploration using LoRA and Hugging Face's...

28
Experimental
76 YanSte/NLP-LLM-Fine-tuning-Llame-2-QLoRA-2024

Natural Language Processing (NLP) and Large Language Models (LLM) with...

28
Experimental
77 YuanheZ/LoRA-One

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large ...

28
Experimental
78 TobyYang7/Llava_Qwen2

Visual Instruction Tuning for Qwen2 Base Model

28
Experimental
79 MusfiqDehan/Llama2-Finetuned-for-Translation

Fine-Tuned Llama-2 For Machine Translation

27
Experimental
80 Marker-Inc-Korea/KO-Platypus

[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model

27
Experimental
81 jkanalakis/finetuning-llama-model-for-text-generation-using-unsloth

Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI

26
Experimental
82 rambodazimi/KD-LoRA

KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge...

26
Experimental
83 GURPREETKAURJETHRA/LLMs-Inference-and-Fine-Tuning

Estimate Memory Consumption of LLMs Inference and Fine Tuning

26
Experimental
84 aniquetahir/JORA

JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)

26
Experimental
85 adithya-s-k/Indic-llm

A open-source framework designed to adapt pre-trained Language Models...

26
Experimental
86 Rs-py/HowToFineTuneLlama3.1

Quick tutorial showing how to fine-tune Llama3.1 with nothing but free tools...

26
Experimental
87 LimDoHyeon/EEG-LLM

Fine-tuned LLM for electroencephalography(EEG) data classification

25
Experimental
88 ambideXtrous9/Finetune-Qwen3-using-Unsloth

Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset

25
Experimental
89 HenryNdubuaku/super-lazy-autograd

Hand-derived memory-efficient VJPs for tuning LLMs on laptops.

25
Experimental
90 daniau23/LoRAfrica

LoRAfrica: Scaling LLM Fine Tuning for African History

24
Experimental
91 strickvl/isafpr_finetune

Finetuning an LLM for structured data extraction from press releases

24
Experimental
92 krishnaplwl/Homework_Solver_LLM

A fine-tuned LLM to solve homework questions ranging from maths to science...

24
Experimental
93 inuwamobarak/Meta-Llama-3-8B

Experiments with the Meta-Llama-3-8B

24
Experimental
94 heyisula/infosage-13b

LLM pretraining pipeline using the FineWeb-Edu Dataset

23
Experimental
95 nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle

🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom...

23
Experimental
96 sovit-123/lm_sft

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised...

23
Experimental
97 mattialoszach/LoRA-Agentic-Output-Format

Fine-tuning LLMs for structured agent-style outputs (e.g. JSON), built for...

23
Experimental
98 Emart29/phi4-finance-finetuning

Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA...

23
Experimental
99 mbeps/llama3.1_fine-tuning_mult-it

Fine-tuning various Llama 3.1 family of models on the Mult-It dataset

22
Experimental
100 YYZhang2025/Pali-Gemma

Implement Multi-Modality-LLM and fine tuning the model using LoRA. Only...

22
Experimental
101 mbeps/magistral_mult-it_fine-tuning

Parameter Efficient Fine-Tuning of Magistral Small model on the Mult-It...

22
Experimental
102 SergiuDeveloper/yoro-finetuning

YORO (You-Only-Reason-Once) - a novel LLM architecture that runs the main...

22
Experimental
103 fkuhne/doctune

A fine-tuning pipeline for SLMs

22
Experimental
104 paulocoutinhox/mini-llm

Simple and lightweight tool to fine-tune GPT models (like GPT-2 and GPT-Neo)...

22
Experimental
105 arifme071/llm-finetuning-engineering-domain

Fine-tuned BERT (94.2% accuracy) + LoRA Mistral-7B on railroad AI domain...

22
Experimental
106 anmolg1997/Domain-Adaptive-LLM

Domain-specialized LLM fine-tuning — medical, legal, finance, code domains...

22
Experimental
107 stperrakis/ULM-fit

This repository contains an implementation of the ULMfit (Universal Language...

22
Experimental
108 TLILIFIRAS/Efficient-Fine-Tuning-of-Vision-Language-Models-with-LoRA-Quantization

This project demonstrates parameter-efficient fine-tuning of large...

22
Experimental
109 EN10/BabyLlama

Train and run a small Llama 2 model from scratch on the TinyStories dataset.

22
Experimental
110 Abdur-azure/xlmtec

xlmtec is a powerful, modular, and interactive command-line tool for...

22
Experimental
111 Arlchoose-code/Indonesian-LLM-Finetune

Fine-tune your Indonesian LLM with LoRA — instruction tuning kit designed to...

22
Experimental
112 Abeshith/FineTuning_LanguageModels

🎯 Fine-tune large language models and use them for text-related tasks. This...

22
Experimental
113 garystafford/duke-fine-tuning-llama

DUKE (Document Understanding and Knowledge Extraction) along with...

22
Experimental
114 PriyaDas258/llm-biomedical-finetuning-lab

Fine-tune TinyLlama, Phi-2, and Mistral on PubMedQA using LoRA/QLoRA —...

22
Experimental
115 mbeps/qwen3_fine-tune_mult-it

Parameter Efficient Fine-Tuning of various Qwen3 family of models on the...

22
Experimental
116 renaldiangsar/Medical-LLM-Fine-Tuning

Fine-tuning Large Language Models (LLMs) for medical reasoning to enhances...

22
Experimental
117 khadimhussain0/kllm

Fine-tune state-of-the-art LLMs with LoRA/QLoRA on consumer hardware.

21
Experimental
118 louisc-s/QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

Code for fine-tuning Llama2 LLM with custom text dataset to produce film...

21
Experimental
119 DNLab2024/BGP_LLaMA

BGP-LLaMA: Fine-tuning Open-Source LLM on BGP Routing Knowledge and Analysis

21
Experimental
120 jmaczan/c-137

🦙 Llama 2 7B fine-tuned to revive Rick

21
Experimental
121 YanSte/NLP-LLM-Fine-tuning-DeepSpeed

Natural Language Processing (NLP) and Large Language Models (LLM) with...

21
Experimental
122 Tommaso-Sgroi/VojoLe-LM

DL24-25 project. The goal is Fine-Tuning a LLM on Italian Dialect.

21
Experimental
123 Abu-Sameer-66/ChemLLM-Tox-OLMo

Fine-tuning OLMo-7B with QLoRA & DeepChem for Molecular Toxicity Prediction...

21
Experimental
124 r-kovalch/omnigec-models

Reproducible QLoRA recipes and configs that fine‑tune Aya‑Expanse‑8B and...

21
Experimental
125 ph-ausseil/llm-training-dataset-builder

Streamlines the creation of dataset to train a Large Language Model with...

21
Experimental
126 AIdventures/flora

Fine-tuning LLMs with LoRA

21
Experimental
127 nv-legate/multimesh-jax

PjRt plugin and Python APIs for MPMD workflows in Jax

21
Experimental
128 HEMANGANI/Fine-Tuning-LLM-for-QA

Fine-Tuning Large Language Models for Question Answering

20
Experimental
129 arunpshankar/VAI-FineTuning-LLMs

"Clean and comprehensive examples for fine-tuning LLMs supported by Vertex...

20
Experimental
130 Eric-he-cn/Qwen3-QLoRA-News

This project enables the model to directly generate structured summaries...

20
Experimental
131 SauravMaheshkar/nanollm

JAX LLM playground

20
Experimental
132 zufeshan12/fine-tuning-and-reinforcement-learning-on-llms

supervised fine tuning and RLAIF on DeepSeek-math-7b-base using LoRA...

20
Experimental
133 neoheartbeats/neoheartbeats-kernel

An architecture for LLMs' continual-learning and long-term memories

20
Experimental
134 chaithanyasai18/LLMs-finetuning

This repository consists of python scripts for LLM finetuning (SFT, LoRA,...

20
Experimental
135 priyam-hub/LLM-Fine-Tuning-Pipeline

A comprehensive pipeline for Different Fine-Tuning Methods for Large...

19
Experimental
136 Pavansomisetty21/Visual-Question-Answering-Pixtral_Vision_Finetuning_Unsloth

In this we finetune Pixtral-12B-2409 model using unsloth for visual Question...

19
Experimental
137 RenaudGaudron/llm-quantisation-performance-study

Code and data accompanying the article "The impact of quantising a small...

19
Experimental
138 jwliao1209/Taiwan-LLaMa-Instruction-Tuning

2023 NTU CSIE ADL Homework 3

19
Experimental
139 OutllierRejects/Intellihack_OutlierRejects_Task3

LLM Fine-tuning Challenge Enhancing Qwen 2.5 3B for AI Research QA

19
Experimental
140 c4dt/pitfalls_in_fine_tuning_llms

Jupyter notebooks for the LLM fine-tuning pitfalls hands-on workshop

19
Experimental
141 manufactai/finetuning-cookbook

A collection of practical examples and tutorials for fine-tuning large...

19
Experimental
142 giankev/Ancient-to-Modern-Italian-Automatic-Translation

Finetuning and evaluating LLMs on Ancient-to-Modern Italian translation task.

18
Experimental
143 atasoglu/turkish-llava-notebooks

A useful collection of notebooks for quantization, fine-tuning, and...

18
Experimental
144 erraji-jo/LLM-Finutune-based-on-customData

The project aims to showcase the process of fine-tuning LLMs on...

18
Experimental
145 jo-valer/machine-translation-ladin-fascian

Repository of our paper Nesciun Lengaz Lascià Endò: Machine Translation for...

18
Experimental
146 monadicarts/mistral-7b-trainer

Mistral 7b v0.3 LLM Model Trainer

18
Experimental
147 garyfanhku/Galore-pytorch

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

18
Experimental
148 sjsayedkader/FineTuning-paris2024-olympics

End-to-end LLM fine-tuning: Paris 2024 Olympics Q&A using Databricks, AWS...

17
Experimental
149 NgJaBach/Language-Models-Utilities

Collect and share guidance + code snippets for running LM-related tasks.

17
Experimental
150 rvats20/LLM-Classification-Finetuning

Welcome to the LLM Classification Finetuning repository! This project...

17
Experimental
151 mehrdadalmasi2020/microsoft_MiniLM_L12_H384_uncased

A library that leverages the pre-trained microsoft_MiniLM-L12-H384-uncased...

17
Experimental
152 shizheng-rlfresh/llm-opt

Fine-tuning LLMs with LoRA and Hessian-free optimizers

17
Experimental
153 PrathamLearnsToCode/Fine-tuning-FLAN-T5-with-LoRA-WandB

Fine tune an LLM for summarization task using Low rank adaptation

17
Experimental
154 slv-ai/Fine-Tune-LLMs-with-DPO

Fine-tuning Microsoft’s Phi-2 Machine Learning Model with DPO

17
Experimental
155 sanskaryo/LLM-Finetuning-Projects

This repository contains various projects focused on fine-tuning Large...

17
Experimental
156 clement-cvll/AIMO-Math-Finetuning

Fine tuning of a model for AIMO 2 math competition on Kaggle

17
Experimental
157 sahilfaizal01/Kaggle-Contest---Fine-tuning-Llama-3.1-LLM-

We used the Llama-3.1 8B (LLM) model to verify math problem solutions via...

17
Experimental
158 mfaizan-ai/NewsQA

News QA generation and fine tuning an LLM for QA generation (under development)

17
Experimental
159 Rishabh9559/medical-llama-3.2-3B-model

This is all about fine-tuning the Llama3.2-3B model on your medical textbook.

16
Experimental
160 dineshsoudagar/llm-lab-from-scratch-to-fine-tuning

Comprehensive resources and scripts for training and fine-tuning Large...

15
Experimental
161 sparkup/medical-llm-finetuning-alignment

Medical LLM fine-tuning and preference alignment using SFT and DPO, with...

15
Experimental
162 spatialft/spatialft.github.io

LoRA fine-tuning of LFM2.5-1.2B to improve spatial reasoning on StepGame —...

14
Experimental
163 igna-s/QLoRA-Experiments

A collection of SFT and distillation pipelines to train specialized medical...

14
Experimental
164 Gholamrezadar/finetuning_llm_on_letter_counting

Fine-tuning Gemma-3 4B on the letter-counting dataset

14
Experimental
165 YounesBensafia/Algeria-2-0-FineTuning-workshop

This repository contains resources and examples used in my workshop for...

14
Experimental
166 Pects1949/LLM-Fine-tuning-Toolkit

A comprehensive toolkit for fine-tuning and deploying Large Language Models...

14
Experimental
167 Witurpred64/LLM-FineTuning-Toolkit

A comprehensive toolkit for fine-tuning Large Language Models (LLMs) with...

14
Experimental
168 di37/full-fine-tuning-nvidia-question-and-answering

Flan-t5-base model was fine-tuned on Nvidia Question and Answer Pair Dataset...

14
Experimental
169 Isha1600/LLM-Finetuning

Fine-tuning Large Language Models (LLMs) using custom datasets for improved...

14
Experimental
170 codershiyar/llama-google-colab-tutorial

Step-by-step tutorial on loading and using Llama 3.1 8B Instruct in Google...

14
Experimental
171 aakarsh31/qlora-llm-finetuning

QLoRA fine-tuning of Llama 3.2 3B on MedQA with full LoRA rank ablation...

14
Experimental
172 ahmad-albasha/Frankenstein-LLM-Model-fine-tuning-code

Fine-tuning Mistral-7B-v0.1 on Mary Shelley's Frankenstein using LoRA/QLoRA...

14
Experimental
173 jinda-liu/R-LoRA

This repository contains the source code and related resources for R-LoRA.

14
Experimental
174 Gyldenn/storywriter

Fine-tuning Mistral 7B with LoRA (QLoRA 4-bit) to generate Shakespearean...

14
Experimental
175 gazelle93/llm-fine-tuning-sft-lora-qlora

Practical examples for fine-tuning large language models (LLMs) with SFT,...

13
Experimental
176 alinourian/Fine-tuning-Mistral-7b-QA

Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and...

13
Experimental
177 alvi75/MultiTask-QLoRA-NFAnalysis

Official implementation of "Parameter-Efficient Multi-Task Fine-Tuning in...

13
Experimental
178 Akarsh1/Exploring-Unsloth-Library-for-Fine-Tuning

This is a sample notebook that can be used for exploring the fine-tuning of...

13
Experimental
179 ayushtiwari134/llm_fine_tuning

This model is fine-tuned to respond like Michael Gary Scott, Regional...

13
Experimental
180 FlorinAndrei/llm-social-media-cheap

LLMs fine-tuned with social media comments on cheap hardware

13
Experimental
181 MSWagner/qwen-lora-grpo-letter-counting

Fine-tuning Qwen2.5-3B-Instruct model with LoRa (Low-Rank Adaptation) and...

13
Experimental
182 Yousefbadr0/GPT-Neo_Medical_Fine-Tuning_using_LoRA

Fine-tuning GPT-Neo-125M using LoRA on a medical QA dataset, achieving...

13
Experimental
183 AparnaRoy76/LLM-finetuning

A comprehensive toolkit for fine-tuning Large Language Models (LLMs) using...

13
Experimental
184 nglguarino/code-completion

Fine-tuned 3 LLMs (Phi-2, Gemma, Llama2) on 100K+ instruction CodeInstruct...

13
Experimental
185 Sahar-Sheikhi/CRM-Data-Automation-Llama-3.2-Finetuned-

A memory-efficient fine-tuning pipeline using Llama-3.2-3B and QLoRA to...

13
Experimental
186 adityanaranje/FineTune-LLM

Fine-tuned a pretrained language model using Unsloth to specialize domain...

12
Experimental
187 Utshav-paudel/Finetuning-Mistral7B-on-google-colab

Finetuning Mistral 7B on google colab

12
Experimental
188 AbdulSametTurkmenoglu/unsloth_llama_news

Llama 2 7B - Turkish News Dataset Fine-Tuning

12
Experimental
189 Atomheart-Father/LoRA-SFT-vs-LoRA-DPO-A-Comparative-Study-of-Small-Factual-Updates-in-LLMs

This paper studies small factual updates: updates that preserve the subject...

12
Experimental
190 AvinashBolleddula/Domain-Adaptive-LLM-Fine-Tuning-for-Enterprise-Policy-QA

Production-grade pipeline for domain-adaptive fine-tuning of a small LLM...

12
Experimental
191 mltrev23/flan-t5-fine-tune

Flan-t5 model fine tune LoRA and Langchain

12
Experimental
192 luochang212/sft-note

三种方法实现监督微调 (SFT):LLaMA Factory, trl 和 unsloth

12
Experimental
193 chatterjeesaurabh/Dialogue-Summarization-with-Large-Language-Model

Explored In-Context prompt learning, Full Fine-Tuning, Parameter-Efficient...

11
Experimental
194 jistiak/finetune-gpt-deepspeed

Sample codes and guidelines on how to finetune any opensource GPT models...

11
Experimental
195 Muneeb1030/FineTune-Tiny-Llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using...

11
Experimental
196 serkanars/llm-fine-tuning-with-lora

LoRA yaklaşımıyla Mistral-7b-v0.1 modelini spesifik bir task için fine-tune etme

11
Experimental
197 tensor-fusion/sophia-jax

JAX implementation of 'Sophia: A Scalable Stochastic Second-order Optimizer...

11
Experimental
198 Shreyash-Gaur/TensorFlow_Python_Code_Generation

Fine-tuning CodeT5 for Python code generation on the MBPP dataset. Features...

11
Experimental
199 pracheeeeez/Fine_tuning_Llama2

This project focuses on fine-tuning the powerful Llama2 language model and...

11
Experimental
200 Holy-Morphism/VLM

Fine-Tuning a Generative VLM for Image Describing

11
Experimental
201 leodeveloper/phi3-vision-multimodel

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

11
Experimental
202 ako1983/Llama2-finetuned-mindsdb

Llama2 7-b-hf Fine-tuned on MindsDB Docs

11
Experimental
203 aloobun/llama2-7b-openhermes-15k

A 4-bit qlora refinement of llama-v2-guanaco, fine tuned on the 15k rows of...

11
Experimental
204 thatomaelane/Building-a-Domain-Expert-Model

This project aims to fine-tune the Meta Llama 2 7B foundation model to...

10
Experimental
205 mayur-kun/finetuning-llama2-7b-chat

This repository demonstrates fine-tuning an Large Language Model (LLM) on...

10
Experimental
206 NavodPeiris/Vulnerability-Analyst-Qwen2.5-1.5B-Instruct

Fine-tune Qwen2.5-1.5B-Instruct model for code vulnerability analysis

10
Experimental
207 Siddhesh19991/Llama-3-8B-Fine-tune

This project demonstrates how to Fine-Tune Llama-3-8B model on medical data...

10
Experimental
208 Thiraput01/QwenMed

Qwen3 fine-tuned on medical datasets with reasoning data

10
Experimental
209 AbdulHadi806/LLM_fune_tuning_Hackathon

In the recent competition, we were challenged to finetune a model that can...

10
Experimental
210 khaouitiabdelhakim/llm_fine_tuning

Fine-tuning essentially involves taking a pre-trained LLM, already equipped...

10
Experimental
211 ahmadalsharef994/Langchain_LlamaCPP_Mistral_7B_Fine_Tuning_Example

A comprehensive example of fine-tuning Mistral 7B models with Langchain and...

10
Experimental
212 Pragateeshwaran/LoRA-From-Scratch

This project implements a Low-Rank Adaptation (LoRA) technique from scratch...

10
Experimental