mallorbc/Finetune_LLMs
Repo for fine-tuning Casual LLMs
This project helps AI developers fine-tune large language models (LLMs) to perform specific tasks or generate text in a particular style. You provide a pre-trained LLM and a custom dataset, and it outputs a more specialized LLM. This is for machine learning engineers, data scientists, and AI researchers who need to adapt existing LLMs for unique applications.
458 stars. No commits in the last 6 months.
Use this if you are an AI developer looking to specialize a large language model using methods like DeepSpeed, LoRA, or QLoRA.
Not ideal if you are a non-technical user who wants to use an LLM without coding or managing infrastructure.
Stars
458
Forks
86
Language
Python
License
AGPL-3.0
Category
Last pushed
Mar 27, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/mallorbc/Finetune_LLMs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...
oumi-ai/oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training