yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
This project helps AI practitioners fine-tune large language models (LLMs) for specific tasks. You can take existing popular LLMs and enhance them using your own data, yielding custom models tailored to your needs. This is for AI engineers or researchers who want to adapt off-the-shelf LLMs to perform specialized functions or improve their performance on particular datasets.
6,644 stars. No commits in the last 6 months.
Use this if you need to customize a large language model with your own dataset for better performance or specific tasks, especially with limited GPU resources.
Not ideal if you're looking for an out-of-the-box, pre-trained LLM without any customization or fine-tuning requirements.
Stars
6,644
Forks
589
Language
Python
License
—
Category
Last pushed
Oct 24, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/yangjianxin1/Firefly"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline....
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
GradientHQ/parallax
Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained...