Llm Fine Tuning Transformer Models

There are 212 llm fine tuning models tracked. 4 score above 50 (established tier). The highest-rated is OptimalScale/LMFlow at 59/100 with 8,489 stars. 1 of the top 10 are actively maintained.

Get all 212 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-fine-tuning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	OptimalScale/LMFlow An Extensible Toolkit for Finetuning and Inference of Large Foundation...	59	Established	8,489	Python
2	adithya-s-k/AI-Engineering.academy Mastering Applied AI, One Concept at a Time	57	Established	2,140	Jupyter Notebook
3	jax-ml/jax-llm-examples Minimal yet performant LLM examples in pure JAX	53	Established	244	Python
4	young-geng/scalax A simple library for scaling up JAX programs	52	Established	146	Python
5	riyanshibohra/TuneKit Upload your data → Get a fine-tuned SLM. Free.	49	Emerging	138	Python
6	JIA-Lab-research/LongLoRA Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)	46	Emerging	2,694	Python
7	georgian-io/LLM-Finetuning-Toolkit Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.	46	Emerging	870	Python
8	kyegomez/Finetuning-Suite Finetune any model on HF in less than 30 seconds	45	Emerging	56	Jupyter Notebook
9	MaximeRobeyns/bayesian_lora Bayesian Low-Rank Adaptation for Large Language Models	45	Emerging	37	Python
10	NVlabs/EoRA [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with...	45	Emerging	29	Python
11	ZinYY/TreeLoRA A pytorch implementation of the paper "TreeLoRA: Efficient Continual...	44	Emerging	347	Python
12	SakanaAI/text-to-lora Hypernetworks that adapt LLMs for specific benchmark tasks using only...	44	Emerging	1,214	Python
13	rohan-paul/LLM-FineTuning-Large-Language-Models LLM (Large Language Model) FineTuning	43	Emerging	566	Jupyter Notebook
14	SensAI-PT/LLaMa2lang Convenience scripts to finetune (chat-)LLaMa3 and other models for any language	42	Emerging	313	Python
15	A-baoYang/alpaca-7b-chinese Finetune LLaMA-7B with Chinese instruction datasets	42	Emerging	137	Python
16	VectorInstitute/vectorlm LLM finetuning in resource-constrained environments.	41	Emerging	55	Python
17	bigscience-workshop/xmtf Crosslingual Generalization through Multitask Finetuning	41	Emerging	537	Jupyter Notebook
18	NVlabs/DoRA [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed...	41	Emerging	942	Python
19	liuqidong07/MOELoRA-peft [SIGIR'24] The official implementation code of MOELoRA.	41	Emerging	189	Python
20	punica-ai/punica Serving multiple LoRA finetuned LLM as one	41	Emerging	1,145	Python
21	sandy1990418/Finetune-Qwen2.5-VL Fine-tuning Qwen2.5-VL for vision-language tasks \| Optimized for Vision...	41	Emerging	154	Python
22	architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction Exploring the potential of fine-tuning Large Language Models (LLMs) like...	40	Emerging	89	Python
23	molbal/llm-text-completion-finetune Guide on text completion large language model fine-tuning, including example...	40	Emerging	87	Python
24	rasbt/blog-finetuning-llama-adapters Supplementary material for "Understanding Parameter-Efficient Finetuning of...	40	Emerging	48	Jupyter Notebook
25	metriccoders/one-line-llm-tuner This repository is the source code for fine tuning any LLM in just one line 🔥	40	Emerging	4	Python
26	AlexandrosChrtn/llama-fine-tune-guide Fine-tune the newly released Llama-3.2 lightweight models.	40	Emerging	22	Python
27	rasbt/dora-from-scratch LoRA and DoRA from Scratch Implementations	39	Emerging	217	Jupyter Notebook
28	neuralwork/instruct-finetune-mistral Fine-tune Mistral 7B to generate fashion style suggestions	39	Emerging	35	Python
29	anchen1011/FireAct FireAct: Toward Language Agent Fine-tuning	39	Emerging	292	Python
30	EricLBuehler/xlora X-LoRA: Mixture of LoRA Experts	39	Emerging	267	Python
31	TrelisResearch/install-guides Various installation guides for Large Language Models	38	Emerging	77	Jupyter Notebook
32	di37/finetuning-quantize-evaluate Fine-Tune, Quantize, Evaluate: The Complete Guide — LLMs, VLMs, and Embedding Models	38	Emerging	13	Typst
33	ymoslem/Adaptive-MT-LLM-Fine-tuning Fine-tuning Open-Source LLMs for Adaptive Machine Translation	38	Emerging	92	Jupyter Notebook
34	GiovanniGatti/socratic-llm Training pipeline for fine tuning Phi-3-mini-instruct to follow the Socratic method	38	Emerging	31	Python
35	readytensor/rt-llm-eng-cert-week3 Week 3 of LLM Engineering Certification: Learn to fine-tune large language...	38	Emerging	1	Jupyter Notebook
36	aws-samples/fine-tuning-llm-with-domain-knowledge This repo walks you through how to use transfer learning to fine tune a LLM...	37	Emerging	42	Jupyter Notebook
37	zjohn77/lightning-mlflow-hf Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow	37	Emerging	65	Python
38	promptslab/LLMtuner FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)	37	Emerging	247	Python
39	openmedlab/PULSE PULSE: Pretrained and Unified Language Service Engine	37	Emerging	494	Python
40	ksm26/Finetuning-Large-Language-Models Unlock the potential of finetuning Large Language Models (LLMs). Learn from...	37	Emerging	68	Jupyter Notebook
41	poloclub/Fine-tuning-LLMs Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial	37	Emerging	74	Jupyter Notebook
42	NgJaBach/dark-kit Collect and share guidance + code snippets for running LM-related tasks.	36	Emerging	4	Python
43	SculptAI/GIMKit Guided Infilling Modeling Toolkit	36	Emerging	2	Python
44	Yog-Sotho/LLM-fine-tuner Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes....	35	Emerging	13	Python
45	researchim-ai/models-at-home training models at home	35	Emerging	34	Python
46	ngoanpv/llama2_vietnamese A fine-tuned Large Language Model (LLM) for the Vietnamese language based on...	35	Emerging	17	Python
47	eliahuhorwitz/Spectral-DeTuning Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning...	34	Emerging	85	Python
48	MNoorFawi/curlora The code repository for the CURLoRA research paper. Stable LLM continual...	34	Emerging	53	Jupyter Notebook
49	CristianCristanchoT/chivito Implementación de un LLM basado en Llama finetuneado en español empleando...	34	Emerging	10	Jupyter Notebook
50	rasbt/gradient-accumulation-blog Finetuning BLOOM on a single GPU using gradient-accumulation	34	Emerging	31	Python
51	Pengxin-Guo/FedSA-LoRA Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025]	34	Emerging	60	Python
52	GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning Llama 3 ORPO Fine Tuning on A100 in Colab Pro.	34	Emerging	4	Jupyter Notebook
53	XavierSpycy/hands-on-lora Explore practical fine-tuning of LLMs with Hands-on Lora. Dive into examples...	33	Emerging	8	—
54	DoubleVII/lithft Pretrain, finetune any LLMs from huggingface on your own data.	33	Emerging	4	Python
55	jianzhnie/LLMToolkit LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large...	33	Emerging	6	Python
56	ramalamadingdong/onnx-rubikpi ONNX LLM runtime on RUBIK-Pi with Gemma 1B and Llama 3.2 1B	32	Emerging	2	Python
57	juzhengz/LoRI [COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	32	Emerging	171	Python
58	mddunlap924/PyTorch-LLM Fine-tuning an LLM using a Generic Workflow and Best Practices with PyTorch	32	Emerging	28	Jupyter Notebook
59	BFCmath/FinetuneAI_Learning How to effectively finetune CV/LLM models (without local gpu)	32	Emerging	38	Jupyter Notebook
60	samadon1/LLM-From-Scratch Medical Language Model fine-tuned using pretraining, instruction tuning, and...	32	Emerging	29	Jupyter Notebook
61	naity/finetune-esm Scalable Protein Language Model Finetuning with Distributed Learning and...	31	Emerging	34	Jupyter Notebook
62	j-webtek/Local-LLM_FineTune Finetune Your Local LLM	31	Emerging	18	Python
63	yangjianxin1/LongQLoRA LongQLoRA: Extent Context Length of LLMs Efficiently	31	Emerging	168	Python
64	serp-ai/LLaMA-8bit-LoRA Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on...	31	Emerging	150	Python
65	graphcore-research/jax-scalify JAX Scalify: end-to-end scaled arithmetics	31	Emerging	18	Python
66	Followb1ind1y/Medical-LLM-Fine-tuning Fine-tunes LLaMA-3-8B on PubMedQA with QLoRA, optimized via DeepSpeed and...	30	Emerging	2	Jupyter Notebook
67	ambideXtrous9/GRPO-and-SFT-Finetune-Qwen3-using-Unsloth-Reasoning-and-Non-Reasoning-Dataset GRPO and SFT Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset	30	Emerging	5	Jupyter Notebook
68	sukanyabag/Finetuning-Qwen2-7B-VQA-on-Radiology-Scans This repository is doing the finetuning of the Qwen2 7B VLM for performing...	30	Emerging	6	Jupyter Notebook
69	DianaDorobantu/legal-llm Develop a Romanian legal domain Large Language Model (LLM) using pre-trained...	30	Emerging	5	Python
70	francoislanc/midistral LLM finetuned for generating symbolic music	29	Experimental	42	Python
71	Atomic-man007/falcon-7b-lora-fine-tuning falcon-7b-lora-fine-tuning	29	Experimental	1	Jupyter Notebook
72	mehdihosseinimoghadam/AVA-Llama-3 Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3	29	Experimental	36	Jupyter Notebook
73	PardhuSreeRushiVarma20060119/OpenLoRA "OpenLoRa" is designed to streamline and elevate the fine-tuning of large...	29	Experimental	1	TypeScript
74	Abhi0323/Fine-Tuning-LLaMA-2-with-QLORA-and-PEFT This project enhances the LLaMA-2 model using Quantized Low-Rank Adaptation...	28	Experimental	13	Jupyter Notebook
75	roy-sub/LLM-FineTuning Fine-Tuned Language Models Exploration using LoRA and Hugging Face's...	28	Experimental	11	Jupyter Notebook
76	YanSte/NLP-LLM-Fine-tuning-Llame-2-QLoRA-2024 Natural Language Processing (NLP) and Large Language Models (LLM) with...	28	Experimental	9	Jupyter Notebook
77	YuanheZ/LoRA-One LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large ...	28	Experimental	28	Python
78	TobyYang7/Llava_Qwen2 Visual Instruction Tuning for Qwen2 Base Model	28	Experimental	41	Python
79	MusfiqDehan/Llama2-Finetuned-for-Translation Fine-Tuned Llama-2 For Machine Translation	27	Experimental	10	Jupyter Notebook
80	Marker-Inc-Korea/KO-Platypus [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model	27	Experimental	73	Jupyter Notebook
81	jkanalakis/finetuning-llama-model-for-text-generation-using-unsloth Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI	26	Experimental	9	Jupyter Notebook
82	rambodazimi/KD-LoRA KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge...	26	Experimental	22	Python
83	GURPREETKAURJETHRA/LLMs-Inference-and-Fine-Tuning Estimate Memory Consumption of LLMs Inference and Fine Tuning	26	Experimental	3	Jupyter Notebook
84	aniquetahir/JORA JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)	26	Experimental	35	Python
85	adithya-s-k/Indic-llm A open-source framework designed to adapt pre-trained Language Models...	26	Experimental	23	Python
86	Rs-py/HowToFineTuneLlama3.1 Quick tutorial showing how to fine-tune Llama3.1 with nothing but free tools...	26	Experimental	9	Jupyter Notebook
87	LimDoHyeon/EEG-LLM Fine-tuned LLM for electroencephalography(EEG) data classification	25	Experimental	14	Jupyter Notebook
88	ambideXtrous9/Finetune-Qwen3-using-Unsloth Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset	25	Experimental	5	Jupyter Notebook
89	HenryNdubuaku/super-lazy-autograd Hand-derived memory-efficient VJPs for tuning LLMs on laptops.	25	Experimental	38	Python
90	daniau23/LoRAfrica LoRAfrica: Scaling LLM Fine Tuning for African History	24	Experimental	4	Jupyter Notebook
91	strickvl/isafpr_finetune Finetuning an LLM for structured data extraction from press releases	24	Experimental	5	Jupyter Notebook
92	krishnaplwl/Homework_Solver_LLM A fine-tuned LLM to solve homework questions ranging from maths to science...	24	Experimental	2	—
93	inuwamobarak/Meta-Llama-3-8B Experiments with the Meta-Llama-3-8B	24	Experimental	4	Jupyter Notebook
94	heyisula/infosage-13b LLM pretraining pipeline using the FineWeb-Edu Dataset	23	Experimental	2	Jupyter Notebook
95	nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle 🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom...	23	Experimental	1	Jupyter Notebook
96	sovit-123/lm_sft Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised...	23	Experimental	4	Jupyter Notebook
97	mattialoszach/LoRA-Agentic-Output-Format Fine-tuning LLMs for structured agent-style outputs (e.g. JSON), built for...	23	Experimental	2	Jupyter Notebook
98	Emart29/phi4-finance-finetuning Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA...	23	Experimental	2	Jupyter Notebook
99	mbeps/llama3.1_fine-tuning_mult-it Fine-tuning various Llama 3.1 family of models on the Mult-It dataset	22	Experimental	1	Jupyter Notebook
100	YYZhang2025/Pali-Gemma Implement Multi-Modality-LLM and fine tuning the model using LoRA. Only...	22	Experimental	9	Jupyter Notebook
101	mbeps/magistral_mult-it_fine-tuning Parameter Efficient Fine-Tuning of Magistral Small model on the Mult-It...	22	Experimental	1	Python
102	SergiuDeveloper/yoro-finetuning YORO (You-Only-Reason-Once) - a novel LLM architecture that runs the main...	22	Experimental	—	Jupyter Notebook
103	fkuhne/doctune A fine-tuning pipeline for SLMs	22	Experimental	—	Python
104	paulocoutinhox/mini-llm Simple and lightweight tool to fine-tune GPT models (like GPT-2 and GPT-Neo)...	22	Experimental	35	Python
105	arifme071/llm-finetuning-engineering-domain Fine-tuned BERT (94.2% accuracy) + LoRA Mistral-7B on railroad AI domain...	22	Experimental	—	Jupyter Notebook
106	anmolg1997/Domain-Adaptive-LLM Domain-specialized LLM fine-tuning — medical, legal, finance, code domains...	22	Experimental	—	Python
107	stperrakis/ULM-fit This repository contains an implementation of the ULMfit (Universal Language...	22	Experimental	2	Jupyter Notebook
108	TLILIFIRAS/Efficient-Fine-Tuning-of-Vision-Language-Models-with-LoRA-Quantization This project demonstrates parameter-efficient fine-tuning of large...	22	Experimental	—	Jupyter Notebook
109	EN10/BabyLlama Train and run a small Llama 2 model from scratch on the TinyStories dataset.	22	Experimental	5	Jupyter Notebook
110	Abdur-azure/xlmtec xlmtec is a powerful, modular, and interactive command-line tool for...	22	Experimental	—	HTML
111	Arlchoose-code/Indonesian-LLM-Finetune Fine-tune your Indonesian LLM with LoRA — instruction tuning kit designed to...	22	Experimental	1	Python
112	Abeshith/FineTuning_LanguageModels 🎯 Fine-tune large language models and use them for text-related tasks. This...	22	Experimental	5	Jupyter Notebook
113	garystafford/duke-fine-tuning-llama DUKE (Document Understanding and Knowledge Extraction) along with...	22	Experimental	1	Jupyter Notebook
114	PriyaDas258/llm-biomedical-finetuning-lab Fine-tune TinyLlama, Phi-2, and Mistral on PubMedQA using LoRA/QLoRA —...	22	Experimental	—	Python
115	mbeps/qwen3_fine-tune_mult-it Parameter Efficient Fine-Tuning of various Qwen3 family of models on the...	22	Experimental	1	Jupyter Notebook
116	renaldiangsar/Medical-LLM-Fine-Tuning Fine-tuning Large Language Models (LLMs) for medical reasoning to enhances...	22	Experimental	2	Jupyter Notebook
117	khadimhussain0/kllm Fine-tune state-of-the-art LLMs with LoRA/QLoRA on consumer hardware.	21	Experimental	—	Python
118	louisc-s/QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM Code for fine-tuning Llama2 LLM with custom text dataset to produce film...	21	Experimental	9	Python
119	DNLab2024/BGP_LLaMA BGP-LLaMA: Fine-tuning Open-Source LLM on BGP Routing Knowledge and Analysis	21	Experimental	7	Jupyter Notebook
120	jmaczan/c-137 🦙 Llama 2 7B fine-tuned to revive Rick	21	Experimental	1	Jupyter Notebook
121	YanSte/NLP-LLM-Fine-tuning-DeepSpeed Natural Language Processing (NLP) and Large Language Models (LLM) with...	21	Experimental	1	Jupyter Notebook
122	Tommaso-Sgroi/VojoLe-LM DL24-25 project. The goal is Fine-Tuning a LLM on Italian Dialect.	21	Experimental	3	Jupyter Notebook
123	Abu-Sameer-66/ChemLLM-Tox-OLMo Fine-tuning OLMo-7B with QLoRA & DeepChem for Molecular Toxicity Prediction...	21	Experimental	—	Python
124	r-kovalch/omnigec-models Reproducible QLoRA recipes and configs that fine‑tune Aya‑Expanse‑8B and...	21	Experimental	4	Jupyter Notebook
125	ph-ausseil/llm-training-dataset-builder Streamlines the creation of dataset to train a Large Language Model with...	21	Experimental	13	Python
126	AIdventures/flora Fine-tuning LLMs with LoRA	21	Experimental	1	Jupyter Notebook
127	nv-legate/multimesh-jax PjRt plugin and Python APIs for MPMD workflows in Jax	21	Experimental	8	C++
128	HEMANGANI/Fine-Tuning-LLM-for-QA Fine-Tuning Large Language Models for Question Answering	20	Experimental	8	Jupyter Notebook
129	arunpshankar/VAI-FineTuning-LLMs "Clean and comprehensive examples for fine-tuning LLMs supported by Vertex...	20	Experimental	5	Python
130	Eric-he-cn/Qwen3-QLoRA-News This project enables the model to directly generate structured summaries...	20	Experimental	6	Python
131	SauravMaheshkar/nanollm JAX LLM playground	20	Experimental	3	Jupyter Notebook
132	zufeshan12/fine-tuning-and-reinforcement-learning-on-llms supervised fine tuning and RLAIF on DeepSeek-math-7b-base using LoRA...	20	Experimental	1	Jupyter Notebook
133	neoheartbeats/neoheartbeats-kernel An architecture for LLMs' continual-learning and long-term memories	20	Experimental	6	Jupyter Notebook
134	chaithanyasai18/LLMs-finetuning This repository consists of python scripts for LLM finetuning (SFT, LoRA,...	20	Experimental	3	Python
135	priyam-hub/LLM-Fine-Tuning-Pipeline A comprehensive pipeline for Different Fine-Tuning Methods for Large...	19	Experimental	1	Python
136	Pavansomisetty21/Visual-Question-Answering-Pixtral_Vision_Finetuning_Unsloth In this we finetune Pixtral-12B-2409 model using unsloth for visual Question...	19	Experimental	4	Jupyter Notebook
137	RenaudGaudron/llm-quantisation-performance-study Code and data accompanying the article "The impact of quantising a small...	19	Experimental	2	Python
138	jwliao1209/Taiwan-LLaMa-Instruction-Tuning 2023 NTU CSIE ADL Homework 3	19	Experimental	3	Jupyter Notebook
139	OutllierRejects/Intellihack_OutlierRejects_Task3 LLM Fine-tuning Challenge Enhancing Qwen 2.5 3B for AI Research QA	19	Experimental	4	Jupyter Notebook
140	c4dt/pitfalls_in_fine_tuning_llms Jupyter notebooks for the LLM fine-tuning pitfalls hands-on workshop	19	Experimental	4	Jupyter Notebook
141	manufactai/finetuning-cookbook A collection of practical examples and tutorials for fine-tuning large...	19	Experimental	2	Jupyter Notebook
142	giankev/Ancient-to-Modern-Italian-Automatic-Translation Finetuning and evaluating LLMs on Ancient-to-Modern Italian translation task.	18	Experimental	1	Jupyter Notebook
143	atasoglu/turkish-llava-notebooks A useful collection of notebooks for quantization, fine-tuning, and...	18	Experimental	1	Jupyter Notebook
144	erraji-jo/LLM-Finutune-based-on-customData The project aims to showcase the process of fine-tuning LLMs on...	18	Experimental	1	Jupyter Notebook
145	jo-valer/machine-translation-ladin-fascian Repository of our paper Nesciun Lengaz Lascià Endò: Machine Translation for...	18	Experimental	2	Jupyter Notebook
146	monadicarts/mistral-7b-trainer Mistral 7b v0.3 LLM Model Trainer	18	Experimental	1	Python
147	garyfanhku/Galore-pytorch GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection	18	Experimental	22	Python
148	sjsayedkader/FineTuning-paris2024-olympics End-to-end LLM fine-tuning: Paris 2024 Olympics Q&A using Databricks, AWS...	17	Experimental	—	Jupyter Notebook
149	NgJaBach/Language-Models-Utilities Collect and share guidance + code snippets for running LM-related tasks.	17	Experimental	3	Python
150	rvats20/LLM-Classification-Finetuning Welcome to the LLM Classification Finetuning repository! This project...	17	Experimental	1	Jupyter Notebook
151	mehrdadalmasi2020/microsoft_MiniLM_L12_H384_uncased A library that leverages the pre-trained microsoft_MiniLM-L12-H384-uncased...	17	Experimental	1	Python
152	shizheng-rlfresh/llm-opt Fine-tuning LLMs with LoRA and Hessian-free optimizers	17	Experimental	1	Python
153	PrathamLearnsToCode/Fine-tuning-FLAN-T5-with-LoRA-WandB Fine tune an LLM for summarization task using Low rank adaptation	17	Experimental	1	Jupyter Notebook
154	slv-ai/Fine-Tune-LLMs-with-DPO Fine-tuning Microsoft’s Phi-2 Machine Learning Model with DPO	17	Experimental	1	Jupyter Notebook
155	sanskaryo/LLM-Finetuning-Projects This repository contains various projects focused on fine-tuning Large...	17	Experimental	1	Jupyter Notebook
156	clement-cvll/AIMO-Math-Finetuning Fine tuning of a model for AIMO 2 math competition on Kaggle	17	Experimental	1	Jupyter Notebook
157	sahilfaizal01/Kaggle-Contest---Fine-tuning-Llama-3.1-LLM- We used the Llama-3.1 8B (LLM) model to verify math problem solutions via...	17	Experimental	1	Jupyter Notebook
158	mfaizan-ai/NewsQA News QA generation and fine tuning an LLM for QA generation (under development)	17	Experimental	1	Jupyter Notebook
159	Rishabh9559/medical-llama-3.2-3B-model This is all about fine-tuning the Llama3.2-3B model on your medical textbook.	16	Experimental	1	Jupyter Notebook
160	dineshsoudagar/llm-lab-from-scratch-to-fine-tuning Comprehensive resources and scripts for training and fine-tuning Large...	15	Experimental	—	Jupyter Notebook
161	sparkup/medical-llm-finetuning-alignment Medical LLM fine-tuning and preference alignment using SFT and DPO, with...	15	Experimental	2	Jupyter Notebook
162	spatialft/spatialft.github.io LoRA fine-tuning of LFM2.5-1.2B to improve spatial reasoning on StepGame —...	14	Experimental	—	HTML
163	igna-s/QLoRA-Experiments A collection of SFT and distillation pipelines to train specialized medical...	14	Experimental	—	Jupyter Notebook
164	Gholamrezadar/finetuning_llm_on_letter_counting Fine-tuning Gemma-3 4B on the letter-counting dataset	14	Experimental	1	Jupyter Notebook
165	YounesBensafia/Algeria-2-0-FineTuning-workshop This repository contains resources and examples used in my workshop for...	14	Experimental	3	Jupyter Notebook
166	Pects1949/LLM-Fine-tuning-Toolkit A comprehensive toolkit for fine-tuning and deploying Large Language Models...	14	Experimental	—	—
167	Witurpred64/LLM-FineTuning-Toolkit A comprehensive toolkit for fine-tuning Large Language Models (LLMs) with...	14	Experimental	—	Python
168	di37/full-fine-tuning-nvidia-question-and-answering Flan-t5-base model was fine-tuned on Nvidia Question and Answer Pair Dataset...	14	Experimental	22	Jupyter Notebook
169	Isha1600/LLM-Finetuning Fine-tuning Large Language Models (LLMs) using custom datasets for improved...	14	Experimental	—	Jupyter Notebook
170	codershiyar/llama-google-colab-tutorial Step-by-step tutorial on loading and using Llama 3.1 8B Instruct in Google...	14	Experimental	—	Jupyter Notebook
171	aakarsh31/qlora-llm-finetuning QLoRA fine-tuning of Llama 3.2 3B on MedQA with full LoRA rank ablation...	14	Experimental	—	Jupyter Notebook
172	ahmad-albasha/Frankenstein-LLM-Model-fine-tuning-code Fine-tuning Mistral-7B-v0.1 on Mary Shelley's Frankenstein using LoRA/QLoRA...	14	Experimental	—	Jupyter Notebook
173	jinda-liu/R-LoRA This repository contains the source code and related resources for R-LoRA.	14	Experimental	20	Python
174	Gyldenn/storywriter Fine-tuning Mistral 7B with LoRA (QLoRA 4-bit) to generate Shakespearean...	14	Experimental	—	Jupyter Notebook
175	gazelle93/llm-fine-tuning-sft-lora-qlora Practical examples for fine-tuning large language models (LLMs) with SFT,...	13	Experimental	5	Python
176	alinourian/Fine-tuning-Mistral-7b-QA Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and...	13	Experimental	13	Jupyter Notebook
177	alvi75/MultiTask-QLoRA-NFAnalysis Official implementation of "Parameter-Efficient Multi-Task Fine-Tuning in...	13	Experimental	—	R
178	Akarsh1/Exploring-Unsloth-Library-for-Fine-Tuning This is a sample notebook that can be used for exploring the fine-tuning of...	13	Experimental	—	Jupyter Notebook
179	ayushtiwari134/llm_fine_tuning This model is fine-tuned to respond like Michael Gary Scott, Regional...	13	Experimental	—	Jupyter Notebook
180	FlorinAndrei/llm-social-media-cheap LLMs fine-tuned with social media comments on cheap hardware	13	Experimental	5	Jupyter Notebook
181	MSWagner/qwen-lora-grpo-letter-counting Fine-tuning Qwen2.5-3B-Instruct model with LoRa (Low-Rank Adaptation) and...	13	Experimental	—	Jupyter Notebook
182	Yousefbadr0/GPT-Neo_Medical_Fine-Tuning_using_LoRA Fine-tuning GPT-Neo-125M using LoRA on a medical QA dataset, achieving...	13	Experimental	—	Jupyter Notebook
183	AparnaRoy76/LLM-finetuning A comprehensive toolkit for fine-tuning Large Language Models (LLMs) using...	13	Experimental	—	Jupyter Notebook
184	nglguarino/code-completion Fine-tuned 3 LLMs (Phi-2, Gemma, Llama2) on 100K+ instruction CodeInstruct...	13	Experimental	—	Jupyter Notebook
185	Sahar-Sheikhi/CRM-Data-Automation-Llama-3.2-Finetuned- A memory-efficient fine-tuning pipeline using Llama-3.2-3B and QLoRA to...	13	Experimental	—	Jupyter Notebook
186	adityanaranje/FineTune-LLM Fine-tuned a pretrained language model using Unsloth to specialize domain...	12	Experimental	1	Jupyter Notebook
187	Utshav-paudel/Finetuning-Mistral7B-on-google-colab Finetuning Mistral 7B on google colab	12	Experimental	5	Jupyter Notebook
188	AbdulSametTurkmenoglu/unsloth_llama_news Llama 2 7B - Turkish News Dataset Fine-Tuning	12	Experimental	1	Jupyter Notebook
189	Atomheart-Father/LoRA-SFT-vs-LoRA-DPO-A-Comparative-Study-of-Small-Factual-Updates-in-LLMs This paper studies small factual updates: updates that preserve the subject...	12	Experimental	1	Jupyter Notebook
190	AvinashBolleddula/Domain-Adaptive-LLM-Fine-Tuning-for-Enterprise-Policy-QA Production-grade pipeline for domain-adaptive fine-tuning of a small LLM...	12	Experimental	1	Python
191	mltrev23/flan-t5-fine-tune Flan-t5 model fine tune LoRA and Langchain	12	Experimental	8	Python
192	luochang212/sft-note 三种方法实现监督微调 (SFT)：LLaMA Factory, trl 和 unsloth	12	Experimental	4	Jupyter Notebook
193	chatterjeesaurabh/Dialogue-Summarization-with-Large-Language-Model Explored In-Context prompt learning, Full Fine-Tuning, Parameter-Efficient...	11	Experimental	—	Jupyter Notebook
194	jistiak/finetune-gpt-deepspeed Sample codes and guidelines on how to finetune any opensource GPT models...	11	Experimental	—	—
195	Muneeb1030/FineTune-Tiny-Llama Fine-tuning the Tiny Llama model to mimic my professor's writing style using...	11	Experimental	3	Jupyter Notebook
196	serkanars/llm-fine-tuning-with-lora LoRA yaklaşımıyla Mistral-7b-v0.1 modelini spesifik bir task için fine-tune etme	11	Experimental	4	Jupyter Notebook
197	tensor-fusion/sophia-jax JAX implementation of 'Sophia: A Scalable Stochastic Second-order Optimizer...	11	Experimental	3	Python
198	Shreyash-Gaur/TensorFlow_Python_Code_Generation Fine-tuning CodeT5 for Python code generation on the MBPP dataset. Features...	11	Experimental	—	Jupyter Notebook
199	pracheeeeez/Fine_tuning_Llama2 This project focuses on fine-tuning the powerful Llama2 language model and...	11	Experimental	—	Jupyter Notebook
200	Holy-Morphism/VLM Fine-Tuning a Generative VLM for Image Describing	11	Experimental	2	Jupyter Notebook
201	leodeveloper/phi3-vision-multimodel Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface	11	Experimental	—	—
202	ako1983/Llama2-finetuned-mindsdb Llama2 7-b-hf Fine-tuned on MindsDB Docs	11	Experimental	—	Jupyter Notebook
203	aloobun/llama2-7b-openhermes-15k A 4-bit qlora refinement of llama-v2-guanaco, fine tuned on the 15k rows of...	11	Experimental	—	Jupyter Notebook
204	thatomaelane/Building-a-Domain-Expert-Model This project aims to fine-tune the Meta Llama 2 7B foundation model to...	10	Experimental	1	Jupyter Notebook
205	mayur-kun/finetuning-llama2-7b-chat This repository demonstrates fine-tuning an Large Language Model (LLM) on...	10	Experimental	2	Jupyter Notebook
206	NavodPeiris/Vulnerability-Analyst-Qwen2.5-1.5B-Instruct Fine-tune Qwen2.5-1.5B-Instruct model for code vulnerability analysis	10	Experimental	1	Jupyter Notebook
207	Siddhesh19991/Llama-3-8B-Fine-tune This project demonstrates how to Fine-Tune Llama-3-8B model on medical data...	10	Experimental	2	Jupyter Notebook
208	Thiraput01/QwenMed Qwen3 fine-tuned on medical datasets with reasoning data	10	Experimental	1	Jupyter Notebook
209	AbdulHadi806/LLM_fune_tuning_Hackathon In the recent competition, we were challenged to finetune a model that can...	10	Experimental	2	Jupyter Notebook
210	khaouitiabdelhakim/llm_fine_tuning Fine-tuning essentially involves taking a pre-trained LLM, already equipped...	10	Experimental	2	Jupyter Notebook
211	ahmadalsharef994/Langchain_LlamaCPP_Mistral_7B_Fine_Tuning_Example A comprehensive example of fine-tuning Mistral 7B models with Langchain and...	10	Experimental	2	Jupyter Notebook
212	Pragateeshwaran/LoRA-From-Scratch This project implements a Low-Rank Adaptation (LoRA) technique from scratch...	10	Experimental	2	Jupyter Notebook