Reinforcement Learning Frameworks

Complete RL algorithm implementations and educational resources for training agents using policy gradient, Q-learning, actor-critic, and other methods. Does NOT include game-playing agents, robotics simulators, or domain-specific RL applications—only the core algorithmic frameworks and tutorials.

There are 304 reinforcement learning frameworks tracked. 3 score above 70 (verified tier). The highest-rated is DLR-RM/stable-baselines3 at 76/100 with 12,878 stars. 6 of the top 10 are actively maintained.

Get all 304 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=reinforcement-learning-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	DLR-RM/stable-baselines3 PyTorch version of Stable Baselines, reliable implementations of...	76	Verified	12,878	Python
2	google-deepmind/dm_control Google DeepMind's software stack for physics-based simulation and...	73	Verified	4,494	Python
3	Denys88/rl_games RL implementations	71	Verified	1,310	Jupyter Notebook
4	pytorch/rl A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.	68	Established	3,335	Python
5	yandexdataschool/Practical_RL A course in reinforcement learning in the wild	64	Established	6,460	Jupyter Notebook
6	Stable-Baselines-Team/stable-baselines3-contrib Contrib package for Stable-Baselines3 - Experimental reinforcement learning...	64	Established	693	Python
7	flatland-association/flatland-rl The Flatland Framework is a multi-purpose environment to tackle problems...	61	Established	58	Jupyter Notebook
8	keras-rl/keras-rl Deep Reinforcement Learning for Keras.	60	Established	5,554	Python
9	MushroomRL/mushroom-rl Python library for Reinforcement Learning.	60	Established	921	Python
10	takuseno/d3rlpy An offline deep reinforcement learning library	60	Established	1,644	Python
11	lucidrains/streaming-deep-rl Explorations into the proposed Streaming Deep Reinforcement Learning, from...	58	Established	24	Python
12	MyoHub/myosuite MyoSuite is a collection of environments/tasks to be solved by...	58	Established	1,116	Python
13	google-research/batch-ppo Efficient Batched Reinforcement Learning in TensorFlow	58	Established	975	Python
14	qzed/irl-maxent Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning...	57	Established	312	Jupyter Notebook
15	tensorlayer/RLzoo A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀	57	Established	644	Python
16	PKU-Alignment/omnisafe JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.	57	Established	1,077	Python
17	upb-lea/reinforcement_learning_course_materials Lecture notes, tutorial tasks including solutions as well as online videos...	57	Established	1,017	Jupyter Notebook
18	inoryy/reaver Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft...	56	Established	562	Python
19	huggingface/deep-rl-class This repo contains the Hugging Face Deep Reinforcement Learning Course.	55	Established	4,803	MDX
20	rl-tools/rl-tools The Fastest Deep Reinforcement Learning Library	54	Established	933	C++
21	TuragaLab/flybody MuJoCo fruit fly body model and locomotion RL tasks	54	Established	503	Python
22	SforAiDl/genrl A PyTorch reinforcement learning library for generalizable and reproducible...	54	Established	412	Python
23	danijar/mindpark Testbed for deep reinforcement learning	54	Established	162	Python
24	iffiX/machin Reinforcement learning library(framework) designed for PyTorch, implements...	53	Established	419	Python
25	fracapuano/robot-learning-tutorial All the source code for "Robot Learning: A Tutorial". Get involved to be...	53	Established	477	TeX
26	danaugrs/huskarl Deep Reinforcement Learning Framework + Algorithms	53	Established	415	Python
27	AdamStelmaszczyk/learning2run Our NIPS 2017: Learning to Run source code	52	Established	55	Python
28	icoxfog417/baby-steps-of-rl-ja Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード	51	Established	452	Jupyter Notebook
29	keon/deep-q-learning Minimal Deep Q Learning (DQN & DDQN) implementations in Keras	51	Established	1,316	Python
30	danijar/embodied Fast reinforcement learning research	51	Established	61	Python
31	MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning This is the homepage of a new book entitled "Mathematical Foundations of...	51	Established	14,922	MATLAB
32	ikostrikov/pytorch-a3c PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from...	51	Established	1,317	Python
33	ikostrikov/pytorch-a2c-ppo-acktr-gail PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy...	51	Established	3,879	Python
34	ankonzoid/LearningX Deep & Classical Reinforcement Learning + Machine Learning Examples in Python	51	Established	370	Python
35	sweetice/Deep-reinforcement-learning-with-pytorch PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO,...	51	Established	4,589	Python
36	vmayoral/basic_reinforcement_learning An introductory series to Reinforcement Learning (RL) with comprehensive...	51	Established	1,213	Jupyter Notebook
37	mimoralea/gdrl Grokking Deep Reinforcement Learning	51	Established	1,005	Jupyter Notebook
38	stanfordnmbl/osim-rl Reinforcement learning environments with musculoskeletal models	51	Established	944	Python
39	heronsystems/adeptRL Reinforcement learning framework to accelerate research	51	Established	206	Python
40	rlcode/reinforcement-learning Minimal and Clean Reinforcement Learning Examples	51	Established	3,621	Python
41	ShangtongZhang/reinforcement-learning-an-introduction Python Implementation of Reinforcement Learning: An Introduction	51	Established	14,587	Python
42	MorvanZhou/Reinforcement-learning-with-tensorflow Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学	51	Established	9,435	Python
43	dalmia/David-Silver-Reinforcement-learning Notes for the Reinforcement Learning course by David Silver along with...	51	Established	849	Jupyter Notebook
44	miyosuda/async_deep_reinforce Asynchronous Methods for Deep Reinforcement Learning	51	Established	591	Python
45	RLE-Foundation/rllte Long-Term Evolution Project of Reinforcement Learning	51	Established	475	Python
46	pathak22/noreward-rl [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep...	51	Established	1,471	Python
47	archsyscall/DeepRL-TensorFlow2 🐋 Simple implementations of various popular Deep Reinforcement Learning...	51	Established	606	Python
48	rail-berkeley/softlearning Softlearning is a reinforcement learning framework for training maximum...	50	Established	1,413	Python
49	Kaixhin/Rainbow Rainbow: Combining Improvements in Deep Reinforcement Learning	50	Established	1,661	Python
50	jingweiz/pytorch-rl Deep Reinforcement Learning with pytorch & visdom	50	Established	804	Python
51	nikhilbarhate99/PPO-PyTorch Minimal implementation of clipped objective Proximal Policy Optimization...	50	Established	2,320	Python
52	mimoralea/applied-reinforcement-learning Reinforcement Learning and Decision Making tutorials explained at an...	50	Established	331	Jupyter Notebook
53	Fraunhofer-IIS/fmugym Interface to connect Reinforcement Learning libraries to Functional Mock-up...	50	Established	29	Python
54	pat-coady/trpo Trust Region Policy Optimization with TensorFlow and OpenAI Gym	50	Established	361	Jupyter Notebook
55	udacity/reinforcement-learning Reinforcement learning material, code and exercises for Udacity Nanodegree programs.	50	Established	89	Jupyter Notebook
56	ikostrikov/pytorch-trpo PyTorch implementation of Trust Region Policy Optimization	50	Established	450	Python
57	opendilab/DI-engine-docs DI-engine docs (Chinese and English)	49	Emerging	321	Python
58	nrontsis/PILCO Bayesian Reinforcement Learning in Tensorflow	49	Emerging	335	Python
59	seungeunrho/minimalRL Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)	49	Emerging	3,164	Python
60	denisyarats/pytorch_sac PyTorch implementation of Soft Actor-Critic (SAC)	49	Emerging	591	Jupyter Notebook
61	rmst/ddpg TensorFlow implementation of the DDPG algorithm from the paper Continuous...	49	Emerging	215	Jupyter Notebook
62	Stable-Baselines-Team/stable-baselines Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of...	48	Emerging	307	Python
63	andri27-ts/Reinforcement-Learning Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python....	48	Emerging	4,696	Jupyter Notebook
64	lucidrains/metacontroller Implementation of the MetaController proposed in "Emergent temporal...	48	Emerging	93	Jupyter Notebook
65	binary-husky/hmp2g Multiagent Reinforcement Learning Research Project	48	Emerging	228	Python
66	vwxyzjn/cleanrl High-quality single file implementation of Deep Reinforcement Learning...	48	Emerging	9,286	Python
67	alessiodm/drl-zh Deep Reinforcement Learning: Zero to Hero!	48	Emerging	2,265	Jupyter Notebook
68	rl-language/rlc Bringing reinforcement learning to every day programmers	48	Emerging	62	C++
69	zyxue/sutton-barto-rl-exercises 📖Learning reinforcement learning by implementing the algorithms from...	47	Emerging	84	Jupyter Notebook
70	ericyangyu/PPO-for-Beginners A simple and well styled PPO implementation. Based on my Medium series:...	47	Emerging	1,219	Python
71	gordicaleksa/pytorch-learn-reinforcement-learning A collection of various RL algorithms like policy gradients, DQN and PPO....	47	Emerging	161	Python
72	HewlettPackard/dc-rl SustainDC is a set of Python environments for Data Center simulation and...	47	Emerging	95	HTML
73	Cloudslab/DLSF [TMC'20] Deep Learning based Scheduler for Stochastic Fog-Cloud computing...	47	Emerging	126	Java
74	CarperAI/trlx A repo for distributed training of language models with Reinforcement...	46	Emerging	4,738	Python
75	TianhongDai/reinforcement-learning-algorithms This repository contains most of pytorch implementation based classic deep...	46	Emerging	693	Python
76	SuhailSama/MR_RL Gym Simulator for Magnetic Micro Robots	46	Emerging	6	Python
77	godka/Pensieve-PPO The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art...	46	Emerging	87	DIGITAL Command Language
78	medipixel/rl_algorithms Structural implementation of RL key algorithms	45	Emerging	516	Python
79	Anjum48/rl-examples Examples of published reinforcement learning algorithms in recent literature...	45	Emerging	103	Python
80	google-deepmind/dm_env A Python interface for reinforcement learning environments	45	Emerging	394	Python
81	IBM/LOA Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel...	45	Emerging	56	Python
82	yihaosun1124/OfflineRL-Kit An elegant PyTorch offline reinforcement learning library for researchers.	45	Emerging	384	Python
83	gabrielhuang/reptile-pytorch A PyTorch implementation of OpenAI's REPTILE algorithm	45	Emerging	220	Jupyter Notebook
84	denisyarats/drq DrQ: Data regularized Q	44	Emerging	419	Jupyter Notebook
85	huangwl18/modular-rl [ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular...	44	Emerging	232	Jupyter Notebook
86	DeNA/HandyRL HandyRL is a handy and simple framework based on Python and PyTorch for...	44	Emerging	304	Python
87	mahyaret/kuka_rl Reinforcement Learning Experiments using PyBullet	44	Emerging	136	Jupyter Notebook
88	sudharsan13296/Deep-Reinforcement-Learning-With-Python Master classic RL, deep RL, distributional RL, inverse RL, and more using...	43	Emerging	464	Jupyter Notebook
89	rlgraph/rlgraph RLgraph: Modular computation graphs for deep reinforcement learning	43	Emerging	323	Python
90	simoninithomas/Deep_reinforcement_learning_Course Implementations from the free course Deep Reinforcement Learning with...	43	Emerging	3,904	Jupyter Notebook
91	mohmdelsayed/streaming-drl Deep reinforcement learning without experience replay, target networks, or...	43	Emerging	279	Python
92	omerbsezer/Reinforcement_learning_tutorial_with_demo Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration),...	43	Emerging	790	Jupyter Notebook
93	yrlu/irl-imitation Implementation of Inverse Reinforcement Learning (IRL) algorithms in...	43	Emerging	667	Python
94	UoA-CARES/cares_reinforcement_learning CARES Reinforcement Learning Package	42	Emerging	39	Python
95	Learning4Optimization-HUST/H-TSP Official implementation of H-TSP (AAAI2023)	42	Emerging	56	Python
96	Bellman-devs/bellman Model-based reinforcement learning in TensorFlow	42	Emerging	56	Python
97	sebastianbrzustowicz/Robot-Sumo-RL Python + PyTorch. Advanced Reinforcement Learning (SAC/PPO/A2C) for...	42	Emerging	14	Python
98	andrewliao11/Deep-Reinforcement-Learning-Survey My Exploration on Deep Reinforcement Learning Survey	42	Emerging	435	—
99	adrianwix/pybasin pyBasin is a Python library for estimating basin stability in dynamical...	42	Emerging	4	Python
100	dvalenciar/ReinforceUI-Studio ReinforceUI-Studio. A Python-based application designed to simplify the...	42	Emerging	76	Python
101	denisyarats/proto Proto-RL: Reinforcement Learning with Prototypical Representations	41	Emerging	86	Python
102	MaartenGr/ReinLife Creating Artificial Life with Reinforcement Learning	41	Emerging	84	Python
103	Kaixhin/imitation-learning Imitation learning algorithms	41	Emerging	562	Python
104	MarcoMeter/recurrent-ppo-truncated-bptt Baseline implementation of recurrent PPO using truncated BPTT	41	Emerging	160	Jupyter Notebook
105	khushhallchandra/pytorch-rl Pytorch Implementation of RL algorithms	41	Emerging	15	Python
106	tayalmanan28/Safe_Reinforcement_Learning Repository containing the code for safe reinforcement learning in two custom...	41	Emerging	46	Python
107	thanhkaist/CCFDM1 CCFDM reinforcement learning	41	Emerging	40	Python
108	XinJingHao/DRL-Pytorch Clean, Robust, and Unified PyTorch implementation of popular Deep...	41	Emerging	3,306	Python
109	airboxlab/rllib-energyplus Simple EnergyPlus environments for control optimization using reinforcement learning	41	Emerging	55	Python
110	NatLabRockies/graph-env Reinforcement learning for combinatorial optimization over directed graphs	41	Emerging	43	Python
111	rmst/rlrd PyTorch implementation of our paper Reinforcement Learning with Random...	41	Emerging	42	Python
112	mitre/ilpyt ilpyt: imitation learning library with modular, baseline implementations in Pytorch	40	Emerging	18	Python
113	ItoMasaki/PixyzRL A Bayesian RL Framework with Probabilistic Generative Models	40	Emerging	10	Python
114	whoiszyc/IntelliHealer IntelliHealer: An imitation and reinforcement learning platform for...	40	Emerging	32	Python
115	tirthajyoti/RL_basics Basic Reinforcement Learning algorithms	40	Emerging	19	Jupyter Notebook
116	rllab-snu/Deep-Reinforcement-Learning Introduction to Deep Reinforcement Learning	40	Emerging	88	Jupyter Notebook
117	TheoLvs/reinforcement-learning Personal experiments on Reinforcement Learning	40	Emerging	119	Jupyter Notebook
118	MishaLaskin/rad RAD: Reinforcement Learning with Augmented Data	40	Emerging	416	Jupyter Notebook
119	zuoxingdong/lagom lagom: A PyTorch infrastructure for rapid prototyping of reinforcement...	40	Emerging	378	Jupyter Notebook
120	antonpuz/DeROL Deep Reinforcement One-Shot Learning Framework for Artificially Intelligent...	40	Emerging	36	Python
121	araffin/rl-handson-rlvs21 Stable-Baselines3 (SB3) reinforcement learning tutorial for the...	40	Emerging	58	Jupyter Notebook
122	nsidn98/NICE Combining Reinforcement Learning with Integer Programming for Robust Scheduling	40	Emerging	30	Python
123	EsratMaria/Reinforcement-Learning_for_Energy_Minimization_Using_CLoudsim Implementation of RL in the cloud for energy minimization due to migration...	39	Emerging	30	HTML
124	RLE-Foundation/RLeXplore RLeXplore provides stable baselines of exploration methods in reinforcement...	39	Emerging	459	Jupyter Notebook
125	Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind 🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind \|...	39	Emerging	158	Jupyter Notebook
126	luisgarciar/3D-bin-packing Solving the 3D bin packing problem with reinforcement learning	39	Emerging	61	Jupyter Notebook
127	UlisseMini/procgen-tools Tools for running experiments on RL agents in procgen environments	39	Emerging	20	Jupyter Notebook
128	921kiyo/symbolic-rl Symbolic Reinforcement Learning using Inductive Logic Programming	39	Emerging	63	Lasso
129	YuvrajSingh-mist/NeatRL Repository of implementations of classic and sota rl algorithms from scratch...	38	Emerging	221	Python
130	appgym/appgym Mobile Apps (Android) as Environment for Reinforcement Learning Agents	37	Emerging	10	Jupyter Notebook
131	dalmia/udacity-deep-reinforcement-learning My solutions to the projects (and mini-projects) of the Deep Reinforcement...	37	Emerging	63	Jupyter Notebook
132	takuseno/minerva An out-of-the-box GUI tool for offline deep reinforcement learning	37	Emerging	102	JavaScript
133	astier/model-free-episodic-control Model-Free-Episodic-Control implementation.	37	Emerging	17	Python
134	chengxi600/RLStuff A collection of reinforcement learning algorithm implementations	37	Emerging	64	Jupyter Notebook
135	anassinator/pddp WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch	37	Emerging	16	Jupyter Notebook
136	AdamStelmaszczyk/rl-tutorial Source code for "A deep dive into reinforcement learning"	37	Emerging	13	Python
137	VachanVY/Reinforcement-Learning PyTorch implementations of algorithms from "Reinforcement Learning: An...	37	Emerging	204	Python
138	LAMDA-RL/ODIS The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent...	37	Emerging	46	Python
139	saqib1707/RL-PPO-PyTorch Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch	36	Emerging	13	Python
140	shehio/rl Implementing RL agents, one algorithm at a time	36	Emerging	9	Python
141	BY571/CQL PyTorch implementation of the Offline Reinforcement Learning algorithm CQL....	36	Emerging	148	Python
142	kochlisGit/Shadow-Hand-Controller Construction of controllers for Shadow-Hand in Mujoco environment, using...	36	Emerging	22	Python
143	denisyarats/exorl ExORL: Exploratory Data for Offline Reinforcement Learning	36	Emerging	129	Python
144	navneet-nmk/pytorch-rl This repository contains model-free deep reinforcement learning algorithms...	36	Emerging	452	Python
145	BNN-UPC/ENERO Code used in the paper "ENERO: Efficient real-time WAN routing optimization...	36	Emerging	33	Python
146	jayLEE0301/dhrl_official Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse...	35	Emerging	34	Python
147	asystemoffields/disco-torch A PyTorch port of DeepMind's Disco103 — the meta-learned reinforcement...	35	Emerging	9	Python
148	sdpkjc/abcdrl Modular Single-file Reinfocement Learning Algorithms Library	35	Emerging	38	Python
149	schmidtdominik/Rainbow Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient...	34	Emerging	44	Python
150	VladGavra98/SERL Safety-informed Evolutionary Reinforcement Learning applied to...	34	Emerging	10	Python
151	affaan-m/Behavioral_RL Reinforcement Learning with human behavioral biases integration	34	Emerging	12	HTML
152	Shaswat2001/maple-robotics MAPLE (Model and Policy Learning Evaluation) - A unified CLI daemon for...	34	Emerging	7	Python
153	opium-sh/prl Open-source library for a reinforcement learning research.	34	Emerging	54	Python
154	reward-scope-ai/reward-scope Real-time reward debugging and hacking detection for reinforcement learning	34	Emerging	18	Python
155	navneet-nmk/Pytorch-RL-CPP A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)	34	Emerging	101	C++
156	jimimvp/torch_rl Reinforcement learning library for PyTorch.	33	Emerging	11	Python
157	NYU-MLDA/ABC-RL This is work-in-progress (WIP) refactored implementation of...	33	Emerging	8	Verilog
158	AdamStelmaszczyk/dqn TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)	33	Emerging	40	Python
159	nnaisense/pgpelib A mini library for Policy Gradients with Parameter-based Exploration, with...	33	Emerging	73	Python
160	Asap7772/PTR This repository contains the implementation of the PTR algorithm described...	32	Emerging	32	Python
161	lucaslingle/pytorch_rl2 Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'	32	Emerging	72	Python
162	cgel/DRL A collection of Deep Reinforcement Learning algorithms implemented in...	32	Emerging	29	Python
163	CLAIRE-Labo/no-representation-no-trust Codebase to fully reproduce the results of "No Representation, No Trust:...	32	Emerging	31	Python
164	ialexmp/DRL-Generalization Exploring Generalization in Deep Reinforcement Learning algorithms for...	32	Emerging	5	Python
165	matthieu637/ddrl Deep Developmental Reinforcement Learning	32	Emerging	29	C++
166	haron1100/Upside-Down-Reinforcement-Learning Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch	32	Emerging	27	Jupyter Notebook
167	FabioMiguel2000/LOA-feat.Reinforcement-Learning Assigment 2 for Course L.EIC029 Artificial Intelligence, FEUP LEIC 3rd Year...	32	Emerging	3	Python
168	linker81/Reinforcement-Learning-CheatSheet Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition)	31	Emerging	59	TeX
169	harshaljanjani/taskschedulingdqn Designing energy-aware scheduling and task allocation algorithms for online...	31	Emerging	11	Jupyter Notebook
170	mbchang/decentralized-rl Decentralized Reinforcment Learning: Global Decision-Making via Local...	31	Emerging	43	Python
171	goktug97/PEPG-ES Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy	31	Emerging	17	Python
172	JeepWay/DeepPack Unofficial implementation of DeepPack in PyTorch. DeepPack is a deep...	30	Emerging	6	Python
173	Brownwang0426/Reversal-Generative-Reinforcement-Learning A simple model-free and value-function-free reinforcement learning model	30	Emerging	6	Python
174	WinDerek/reinforce-py Reinforcement learning agents in Python (dynamic programming,...	30	Emerging	2	Jupyter Notebook
175	lucadellalib/actorch Deep reinforcement learning framework for fast prototyping based on PyTorch	30	Emerging	14	Python
176	Daraan/ray_utilities ray & RLlib tools for unified code across different repositories....	30	Emerging	6	Python
177	teepanis/nonlinear-pendulum Data and Code Availability -- Universal spectral structure in pendulum-like systems	30	Emerging	1	Jupyter Notebook
178	voaneves/colab-rl Keras implementation of the latest Reinforcement Learning algorithms, ready...	30	Emerging	6	Jupyter Notebook
179	kyegomez/HindsightReplay My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"	30	Emerging	6	Python
180	Skw3mdy/Reinforcement-Learning-Projects 🤖 Explore reinforcement learning techniques with projects including a taxi...	30	Emerging	2	Jupyter Notebook
181	ProfessorNova/PPO-Humanoid PPO implementation for controlling a humanoid in Gymnasium's Mujoco...	30	Emerging	31	Python
182	mindspore-courses/Rainbow-MindSpore About Rainbow-MindSpore! A step-by-step tutorial from DQN to Rainbow	30	Emerging	6	Jupyter Notebook
183	enjeeneer/zero-shot-rl VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low...	30	Emerging	26	Python
184	0xnu/deep-reinforcement-learning Deep Reinforcement Learning (DRL)	30	Emerging	1	Jupyter Notebook
185	rickstaa/stable-learning-control A framework for training theoretically stable (and robust) Reinforcement...	29	Experimental	7	Python
186	tartavull/alfredo Relentlessly learning, persistently failing, but never surrendering.	29	Experimental	9	Python
187	nunesma/reinforcement_learning Deep reinforcement learning techniques for artificial intelligence project	29	Experimental	1	Jupyter Notebook
188	ErickRosete/tacorl TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning	29	Experimental	30	Python
189	rbosh/ml-adp Approximate dynamic programming for stochastic optimal control in Pytorch	29	Experimental	24	Python
190	Naighten/track-simulator Код магистрантской дипломной работы студента НГТУ им Р.Е. Алексеева Жукова...	29	Experimental	1	Python
191	Space-Robotics-Laboratory/rlstar RL STaR is a platform for creating AI for robotic applications. Researchers...	29	Experimental	32	Python
192	dlb-rl/pulse-rl Code for PulseRL: Enabling Offline Reinforcement Learning for Digital...	28	Experimental	8	Python
193	Yuxing-Wang-THU/Surrogate-assisted-ERL A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning	28	Experimental	15	Python
194	motokiomura/Q-DOT [RLC 2025] Official code repository for "Offline Reinforcement Learning with...	28	Experimental	3	Python
195	NVlabs/RL-TNCO RL-TNCO: A reinforcement learning algorithm for solving the tensor network...	28	Experimental	10	Python
196	enginBozkurt/Deep-Reinforcement-Learning-for-Enterprise-Nanodegree Udacity Deep Reinforcement Learning for Enterprise Nanodegree Projects	27	Experimental	8	Jupyter Notebook
197	cubrink/mujoco-2.1-rl-project Implementing Deep Reinforcement Learning Algorithms in Python for use in the...	27	Experimental	17	TeX
198	natetsang/open-rl Implementations of a large collection of reinforcement learning algorithms.	27	Experimental	28	Python
199	MiscellaneousStuff/tlol-rl TLoL (Reinforcement Learning Python Module) - League of Legends RL Module...	27	Experimental	19	Python
200	shivakanthsujit/reducible-loss Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss	27	Experimental	12	Python
201	Jaehyun-Jeong/100LinesRL Clean RL algorithm implementations in under 100 lines each.	27	Experimental	1	Python
202	JaydenTeoh/MORL-Generalization Benchmark for evaluating the generalization capabilities of Multi-Objective...	27	Experimental	26	Python
203	amr-khaled164/GRLMSL 🚀 Optimize microservice instance selection and load balancing in edge...	27	Experimental	1	Python
204	HzcIrving/DLRL-PlayGround The code repo contains multiple code reproduction processes of various SOTA...	26	Experimental	37	Jupyter Notebook
205	declanoller/cat_mouse_continuous_RL Using DDPG and A2C reinforcement learning algorithms to solve a math puzzle	26	Experimental	10	Python
206	sapanz/Udacity-deep-reinforcement-learning-solution This repo will cover most of machine learning algorithms with coding examples.	26	Experimental	4	Jupyter Notebook
207	ugr-sail/paper-drl_building Supplementary material to the paper "An experimental evaluation of Deep...	26	Experimental	20	HTML
208	prototwin/RLExamples PotoTwin Reinforcement Learning Examples	25	Experimental	40	Python
209	Defenser1337/Reinforcement-learning-for-Gradient-descent Application of reinforcement learning to train hyperparameters of gradient...	25	Experimental	1	Jupyter Notebook
210	Now-Join-Us/V0 The code repository for "$V_0$: A Generalist Value Model for Any Policy at...	25	Experimental	5	Python
211	dalmia/P2_Continuous_Control My solution code for the second project of Udacity's Deep Reinforcement...	25	Experimental	5	ASP
212	trunghng/reinforcement_learning_an_introduction Python Implementation for problems in Reinforcement Learning - An Introduction book	24	Experimental	5	Python
213	rafelps/learning-recursive-goal-proposal Learning Recursive Goal Proposal: A hierarchical Reinforcement Learning Approach	24	Experimental	4	Python
214	bmazoure/ppo_jax Jax implementation of Proximal Policy Optimization (PPO) specifically tuned...	24	Experimental	59	Python
215	KeepALifeUS/ml-dqn Rainbow DQN: Double, Dueling, PER, Noisy Nets. Atari benchmarks. PyTorch.	24	Experimental	3	Python
216	dayyass/rllib Reinforcement Learning Library.	23	Experimental	29	Python
217	andranik-sahakyan/team-tron-rl Multi-Agent Reinforcement Learning project exploring the emergence and...	23	Experimental	3	Jupyter Notebook
218	adaptive-intelligent-robotics/HTE This is the repository for the paper Hierarchical Quality-Diversity for...	23	Experimental	4	C++
219	mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore MindSpore implementations of deep reinforcement learning algorithms and environments	23	Experimental	16	Python
220	rorofaiz/awesome-RLVR-boundary 🔍 Explore curated resources on Reinforcement Learning with Verifiable...	23	Experimental	2	—
221	fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks Implementing a new environment in Gymnasium for intelligent schduling	22	Experimental	6	Jupyter Notebook
222	xValentim/ReinforcementLearning_Zero_to_Hero_Course In this repository you will learn all the basic math about Reinforcement...	22	Experimental	6	Jupyter Notebook
223	Rudge0/DynaMO-RL Optimize policy learning by dynamically allocating rollouts and modulating...	22	Experimental	—	Python
224	32olaa/reward-scope 🔍 Detect reward hacking in RL training with RewardScope. Track reward...	22	Experimental	—	Python
225	HGVAbyte/rlhf-data-agent-full 🔍 Generate synthetic preference-ranked datasets for RLHF and DPO training,...	22	Experimental	—	Python
226	AlirezaShamsoshoara/RL-from-zero Comprehensive collection of reinforcement learning algorithms implemented...	22	Experimental	—	Python
227	hmomin/PPO-Winter-Run Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run	22	Experimental	23	TypeScript
228	icaros-usc/dqd-rl Official implementation of "Approximating Gradients for Differentiable...	22	Experimental	22	Python
229	bay3s/ppo-parallel Parallelized implementation of Proximal Policy Optimization (PPO).	22	Experimental	1	Python
230	ARgruny/Deep-Reinforcement-Learning Build and test DRL algorithms in different environments	22	Experimental	2	Jupyter Notebook
231	mohmdelsayed/TinyRL Real-Time Deep RL That Fits in Small Devices	22	Experimental	1	C++
232	aminkhani/Deep-RL You can see a reference for Books, Articles, Courses and Educational...	22	Experimental	20	Jupyter Notebook
233	victor369basu/MyosuiteDDQN In this repository, we try to solve musculoskeletal tasks with `Double DQN...	22	Experimental	17	Python
234	davirenner88-rgb/LR-S 🚀 Emulate Arknights: Endfield servers with LR-S for seamless game...	22	Experimental	1	Zig
235	ankitsharma-tech/Deep-Reinforcement-Learning-With-Pytorch PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3.	22	Experimental	12	Python
236	liyan2015/SUMO-RL-MobiCharger OpenAI-gym-like Reinforcement Learning environment for Dispatching of Mobile...	22	Experimental	15	Python
237	silviomori/udacity-deep-reinforcement-learning-p2-continuous-control Create and train a double-jointed arm agent that is able to maintain its...	22	Experimental	1	Jupyter Notebook
238	mercurycontaminated-sandarac557/KnapsackRL 🎯 Optimize exploration budgets in Reinforcement Learning with KnapsackRL for...	22	Experimental	—	Python
239	mlnjsh/Reinforcement_Learning_Projects 20 RL basics notebooks + 10 advanced projects with Streamlit apps covering...	22	Experimental	—	Jupyter Notebook
240	Axel-Bravo/19_udacity_drlnd Deep Reinforcement Learning Nanodregree from Udacity	21	Experimental	3	Jupyter Notebook
241	TroddenSpade/Exhaustive-Reinforcement-Learning Exhaustive Implementation of Algorithms, Key Papers, and Well-Known Problems...	21	Experimental	12	Jupyter Notebook
242	BackpropTools/BackpropTools A Fast, Portable Deep Reinforcement Learning Library for Continuous Control	21	Experimental	13	C++
243	GTR-GAMES/Deep-Hierarchical-Planning 🔍 Implement efficient long-horizon task planning with this PyTorch...	21	Experimental	1	Python
244	Aryia-Behroziuan/Robot-learning In developmental robotics, robot learning algorithms generate their own...	21	Experimental	9	—
245	PatrickSinger99/ReinforcementLearningInventoryManagement Repository for my bachelor thesis on inventory management in a logistics...	21	Experimental	9	Jupyter Notebook
246	motokiomura/annealed-q-learning [ICML 2025] Official code repository for "Gradual Transition from Bellman...	21	Experimental	8	Python
247	a7med3laa/DRL-Books-resources Deep Reinforcement Learning Books and links for studying	21	Experimental	1	—
248	julia-bel/MAPF_G2RL Implementation of the G2RL approach in the POGEMA environment	21	Experimental	13	Jupyter Notebook
249	iliasoroka1/GRU_Lyapunov_Spectrum Lyapunov Spectrum for Double Pendulum using GRU	21	Experimental	2	Jupyter Notebook
250	zhuzhipeng-123/reinforce-study-for-mmm Reinforcement Learning Research - Exploring RL algorithms in practical scenarios	21	Experimental	—	—
251	Jcorrieri/multiagent-gridworld Deep Reinforcement Learning for Multi-Robot Path Planning using PyTorch, Ray...	21	Experimental	4	Python
252	mbar0075/Advanced-Reinforcement-Learning Deliverables relating to the Advanced Reinforcement Learning University Unit	20	Experimental	6	Jupyter Notebook
253	AndersonPeng/ppo_tutorial PPO pytorch tutorial for continuous control (BipedalWalker-v3)	20	Experimental	11	Jupyter Notebook
254	manjavacas/rl-temario Temario sobre aprendizaje por refuerzo en español.	20	Experimental	5	Typst
255	soovittt/RL-Studio A full-stack platform for designing reinforcement learning environments,...	20	Experimental	1	TypeScript
256	rStar-RL/LoongRL LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts...	20	Experimental	13	Python
257	uzumstanley/DEEP-LEARNING UNIVERSITY OF ROEHAMPTON LONDON	20	Experimental	12	Jupyter Notebook
258	CarsonScott/Dual-Process-Reinforcement An intelligent agent that adaptively changes its thought processes to...	19	Experimental	12	—
259	TroddenSpade/Maximum-Entropy-Deep-IRL Implementations of Maximum Entropy Algorithms for solving Inverse...	19	Experimental	29	Jupyter Notebook
260	brianspiering/rl-course Applied Reinforcement Learning course	19	Experimental	12	Jupyter Notebook
261	ashworks1706/kaelum LATS-based inference with a reward model and online policy router across...	19	Experimental	6	Python
262	openpsi-projects/srl SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores	19	Experimental	15	Python
263	gabotechs/lazaro Reinforcement learning framework for implementing custom models on custom...	19	Experimental	4	Python
264	ArdavanKhalij/RL-Seminar-Project This project is the project of RL course at Vrije Universiteit Brussels and...	18	Experimental	2	Python
265	TroddenSpade/Meta-Reinforcement-Learning Code snippets of Meta Reinforcement Learning algorithms	18	Experimental	39	Jupyter Notebook
266	mhahsler/Introduction_to_Reinforcement_Learning Material for an introduction course to reinforcement learning for compute scientists	18	Experimental	1	Jupyter Notebook
267	zchuning/repo Resilient Model-Based RL by Regularizing Posterior Predictability	18	Experimental	22	Python
268	yamatokataoka/learning-from-human-preferences Replication of Deep Reinforcement Learning from Human Preferences...	18	Experimental	2	TypeScript
269	ArdavanKhalij/MDP machine-learning reinforcement-learning artificial-intelligence...	18	Experimental	2	Jupyter Notebook
270	Bonifatius94/rl-algos SOTA Reinforcement Learning Algorithms	18	Experimental	2	Python
271	mohamedrxo/ppo A comprehensive repository for training OpenAI Gym environments using...	17	Experimental	3	Jupyter Notebook
272	mlnjsh/rl-book-labs 🎮 Interactive browser-based labs for "Complete Reinforcement Learning...	17	Experimental	1	Jupyter Notebook
273	Uzi-gpu/reinforcement-learning Reinforcement Learning projects with Q-Learning, Actor-Critic, and REINFORCE...	17	Experimental	—	Jupyter Notebook
274	undextrois/reinforcement-learning RL Experiments and what not	17	Experimental	—	Python
275	micdestefano/micppo An implementation of Proximal Policy Optimization (PPO)	17	Experimental	1	Python
276	Saifahmadkhan/PlugNPlay This library is a PlugNPlay version of our novel pipeline VacSIM. We have...	17	Experimental	1	Python
277	Talendar/pyderl Simple Deep Reinforcement Learning framework for Python.	17	Experimental	1	Python
278	PieroMacaluso/collaboration-n-competition Implementation of Multi-Agent Deep Deterministic Policy Gradient (MADDPG)...	17	Experimental	1	TeX
279	snthomps/rlhf-ppo-pipeline RLHF/PPO Training Pipeline with Performance Profiling and Optimization Demonstrations	17	Experimental	—	JavaScript
280	kodok13/Label-Free-RLVR 📚 Explore a curated collection of research on Label-Free Reinforcement...	14	Experimental	—	—
281	yelurebajrang/HeteroRL_GEPO ⚡ Optimize heterogeneous reinforcement learning with GEPO for decentralized...	14	Experimental	—	Python
282	Lare1998/rl-for-robotics Reinforcement Learning applications for robotic control and task automation.	14	Experimental	—	Python
283	PathumDilhara/RL-agent-for-CNN-hyper-parameter-optimization A reinforcement learning (RL) based agent that automatically tunes...	14	Experimental	1	Jupyter Notebook
284	igeng/awesome-drl-cloud-scheduling A curated list of research papers, code, and tools applying deep...	14	Experimental	7	—
285	Madid1976/reinforcement-learning-agents Implementations of various reinforcement learning algorithms and agents for...	14	Experimental	—	Python
286	thevilledev/elements-of-ai-idea Project pitch on using reinforcement learning for resource scheduling	13	Experimental	—	—
287	Develop-Packt/Building-an-Artificial-Intelligence-Algorithm Learn how to build a machine learning mode and get started on the popular...	13	Experimental	—	Jupyter Notebook
288	Tahernezhad/Continuous-Control-Workbench A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is...	13	Experimental	—	Python
289	MatTheTab/GHOST_RL_materials Materials for Reinforcement Learning and Machine Learning in games for GHOST.	13	Experimental	—	Jupyter Notebook
290	creativequotient/reinforcement-learning reinforcement learning algorithms implemented in pytorch	12	Experimental	6	Python
291	s1ddh-rth/HumanoidClimb-RL This project explores the application of reinforcement learning (RL) to...	12	Experimental	8	Python
292	alexkalergis/Robot-writing-via-Reinforcement-Learning • Master Thesis • In this project we aim to create a virtual environment of...	11	Experimental	—	Python
293	eg424/DDPG-Pendulum Deep Deterministic Policy Gradient for Healthcare	11	Experimental	—	Python
294	Pablo-Caballero-Maria/Reinforcement-learning-in-robotics Jupyter notebook for reinforcement learning algorithm comparison	11	Experimental	—	Jupyter Notebook
295	fcrlab-unime/rl-examples Collection of examples for Reinforcement Learning	11	Experimental	—	Jupyter Notebook
296	sagarnildass/Udacity-Deep-Reinforcement-Learning-Nanodegree Projects and Exercises of Udacity Deep Reinforcement Learning Nanodegree	11	Experimental	—	Jupyter Notebook
297	PieroMacaluso/continuous-control Implementation of Deep Deterministic Policy Gradient (DDPG) for the Reacher...	11	Experimental	—	TeX
298	PieroMacaluso/navigation Implementation of Deep Q Network for the Banana environment in the context...	11	Experimental	—	TeX
299	juliusfrost/RL-library-activity-tracker A simple RL library activity tracker with github badges	11	Experimental	—	—
300	LearningFish/rlai2cpp C++ Implementation of Reinforcement Learning: An Introduction (2nd Edition)	11	Experimental	4	C++
301	paogam1997/Deep-Reinforcement-Learning This repository is still in progress, it contains the file created to the...	10	Experimental	2	MATLAB
302	niho/ReinforcementLearning Deep Reinforcement Learning in Swift	10	Experimental	2	Swift
303	adielashrov/Enhance-DL-with-SBM-Modelsward2023 Repository for "Enhancing Deep Learning with Scenario-Based Override Rules:...	10	Experimental	2	C++
304	mahfuztbt/Investigating-the-Use-of-Reinforcement-Learning-for-Autonomous-Decision-Making-in-Robotics Hi I am applying for my master's degree in China. This repo represent my...	10	Experimental	2	—

Comparisons in this category

stable-baselines3 and stable-baselines3-contrib (76 vs 64) rl_games and Practical_RL (71 vs 64)