yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

/ 100

Emerging

This project helps AI practitioners fine-tune large language models (LLMs) for specific tasks. You can take existing popular LLMs and enhance them using your own data, yielding custom models tailored to your needs. This is for AI engineers or researchers who want to adapt off-the-shelf LLMs to perform specialized functions or improve their performance on particular datasets.

6,644 stars. No commits in the last 6 months.

Use this if you need to customize a large language model with your own dataset for better performance or specific tasks, especially with limited GPU resources.

Not ideal if you're looking for an out-of-the-box, pre-trained LLM without any customization or fine-tuning requirements.

LLM-customization natural-language-processing AI-model-training computational-linguistics generative-AI

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 19 / 25

How are scores calculated?

Stars

6,644

Forks

589

Language

Python

License

—

Higher-rated alternatives

shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline....

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere

CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

CLUEbenchmark/CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained...

Explore Transformer Models

All categories Trending Transformer directory Insights