FareedKhan-dev/train-deepseek-r1

Building DeepSeek R1 from Scratch

/ 100

Emerging

This project helps AI researchers and machine learning engineers understand and replicate the training process for DeepSeek R1, a reasoning-focused large language model. It takes a smaller base language model and reasoning-centric datasets as input, and outputs a fine-tuned LLM with enhanced problem-solving capabilities. It's designed for those who want to explore and implement advanced reinforcement learning techniques for improving LLM reasoning.

749 stars. No commits in the last 6 months.

Use this if you are an AI researcher or machine learning engineer looking to implement and understand the detailed training methodology of reasoning-oriented large language models, particularly DeepSeek R1.

Not ideal if you are an end-user simply looking to use an existing powerful language model for tasks, rather than building or fine-tuning one yourself.

large-language-models reinforcement-learning AI-research LLM-fine-tuning reasoning-AI

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

749

Forks

121

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

LLM-Red-Team/metaso-free-api

🚀 秘塔AI搜索逆向API【特长：超强检索超长输出】，支持高速流式输出、超强联网搜索（全网or学术以及简洁、深入、研究三种模式），零配置部署，多路token支持，仅供测试，如需商用请前往官方开放平台。

MaxiDonkey/DelphiDeepseek

The Deepseek API wrapper for Delphi leverages Deepseek’s advanced models to deliver powerful...

LLM-Red-Team/deepseek-free-api

🚀 DeepSeek-V3 &...

deepseek-php/deepseek-laravel

Laravel wrapper for Deepseek PHP client, to seamless deepseek API integration with laravel applications.

xiaoY233/DeepSeek-Free-API

🚀 DeepSeek-V3.2 &...

Explore LLM Tools

All categories Trending LLM Tool directory Insights