git-cloner/llama-lora-fine-tuning
llama fine-tuning with lora
This project helps AI engineers and researchers customize large language models (LLMs) like Llama for specific conversational tasks. You provide a Llama-7b base model and your own conversation data (e.g., ShareGPT), and it produces a fine-tuned model capable of multi-round dialogues. It's designed for individuals or teams who need to adapt an existing LLM to a unique dataset or dialogue style.
140 stars. No commits in the last 6 months.
Use this if you need to fine-tune a Llama-7b model for multi-round conversations on a single GPU with 16GB of memory or more.
Not ideal if you don't have access to a GPU or prefer to fine-tune smaller models with lower computational requirements.
Stars
140
Forks
16
Language
Python
License
MIT
Category
Last pushed
May 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/git-cloner/llama-lora-fine-tuning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...
oumi-ai/oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training