FareedKhan-dev/train-deepseek-r1
Building DeepSeek R1 from Scratch
This project helps AI researchers and machine learning engineers understand and replicate the training process for DeepSeek R1, a reasoning-focused large language model. It takes a smaller base language model and reasoning-centric datasets as input, and outputs a fine-tuned LLM with enhanced problem-solving capabilities. It's designed for those who want to explore and implement advanced reinforcement learning techniques for improving LLM reasoning.
749 stars. No commits in the last 6 months.
Use this if you are an AI researcher or machine learning engineer looking to implement and understand the detailed training methodology of reasoning-oriented large language models, particularly DeepSeek R1.
Not ideal if you are an end-user simply looking to use an existing powerful language model for tasks, rather than building or fine-tuning one yourself.
Stars
749
Forks
121
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/FareedKhan-dev/train-deepseek-r1"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
LLM-Red-Team/metaso-free-api
🚀 秘塔AI搜索逆向API【特长:超强检索超长输出】,支持高速流式输出、超强联网搜索(全网or学术以及简洁、深入、研究三种模式),零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
MaxiDonkey/DelphiDeepseek
The Deepseek API wrapper for Delphi leverages Deepseek’s advanced models to deliver powerful...
LLM-Red-Team/deepseek-free-api
🚀 DeepSeek-V3 &...
deepseek-php/deepseek-laravel
Laravel wrapper for Deepseek PHP client, to seamless deepseek API integration with laravel applications.
xiaoY233/DeepSeek-Free-API
🚀 DeepSeek-V3.2 &...