Reinforcement Learning Frameworks

Complete RL algorithm implementations and educational resources for training agents using policy gradient, Q-learning, actor-critic, and other methods. Does NOT include game-playing agents, robotics simulators, or domain-specific RL applications—only the core algorithmic frameworks and tutorials.

There are 304 reinforcement learning frameworks tracked. 3 score above 70 (verified tier). The highest-rated is DLR-RM/stable-baselines3 at 76/100 with 12,878 stars. 6 of the top 10 are actively maintained.

Get all 304 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=reinforcement-learning-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of...

76
Verified
2 google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and...

73
Verified
3 Denys88/rl_games

RL implementations

71
Verified
4 pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

68
Established
5 yandexdataschool/Practical_RL

A course in reinforcement learning in the wild

64
Established
6 Stable-Baselines-Team/stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning...

64
Established
7 flatland-association/flatland-rl

The Flatland Framework is a multi-purpose environment to tackle problems...

61
Established
8 keras-rl/keras-rl

Deep Reinforcement Learning for Keras.

60
Established
9 MushroomRL/mushroom-rl

Python library for Reinforcement Learning.

60
Established
10 takuseno/d3rlpy

An offline deep reinforcement learning library

60
Established
11 lucidrains/streaming-deep-rl

Explorations into the proposed Streaming Deep Reinforcement Learning, from...

58
Established
12 MyoHub/myosuite

MyoSuite is a collection of environments/tasks to be solved by...

58
Established
13 google-research/batch-ppo

Efficient Batched Reinforcement Learning in TensorFlow

58
Established
14 qzed/irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning...

57
Established
15 tensorlayer/RLzoo

A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀

57
Established
16 PKU-Alignment/omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

57
Established
17 upb-lea/reinforcement_learning_course_materials

Lecture notes, tutorial tasks including solutions as well as online videos...

57
Established
18 inoryy/reaver

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft...

56
Established
19 huggingface/deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

55
Established
20 rl-tools/rl-tools

The Fastest Deep Reinforcement Learning Library

54
Established
21 TuragaLab/flybody

MuJoCo fruit fly body model and locomotion RL tasks

54
Established
22 SforAiDl/genrl

A PyTorch reinforcement learning library for generalizable and reproducible...

54
Established
23 danijar/mindpark

Testbed for deep reinforcement learning

54
Established
24 iffiX/machin

Reinforcement learning library(framework) designed for PyTorch, implements...

53
Established
25 fracapuano/robot-learning-tutorial

All the source code for "Robot Learning: A Tutorial". Get involved to be...

53
Established
26 danaugrs/huskarl

Deep Reinforcement Learning Framework + Algorithms

53
Established
27 AdamStelmaszczyk/learning2run

Our NIPS 2017: Learning to Run source code

52
Established
28 icoxfog417/baby-steps-of-rl-ja

Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード

51
Established
29 keon/deep-q-learning

Minimal Deep Q Learning (DQN & DDQN) implementations in Keras

51
Established
30 danijar/embodied

Fast reinforcement learning research

51
Established
31 MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of...

51
Established
32 ikostrikov/pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from...

51
Established
33 ikostrikov/pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy...

51
Established
34 ankonzoid/LearningX

Deep & Classical Reinforcement Learning + Machine Learning Examples in Python

51
Established
35 sweetice/Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO,...

51
Established
36 vmayoral/basic_reinforcement_learning

An introductory series to Reinforcement Learning (RL) with comprehensive...

51
Established
37 mimoralea/gdrl

Grokking Deep Reinforcement Learning

51
Established
38 stanfordnmbl/osim-rl

Reinforcement learning environments with musculoskeletal models

51
Established
39 heronsystems/adeptRL

Reinforcement learning framework to accelerate research

51
Established
40 rlcode/reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

51
Established
41 ShangtongZhang/reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

51
Established
42 MorvanZhou/Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

51
Established
43 dalmia/David-Silver-Reinforcement-learning

Notes for the Reinforcement Learning course by David Silver along with...

51
Established
44 miyosuda/async_deep_reinforce

Asynchronous Methods for Deep Reinforcement Learning

51
Established
45 RLE-Foundation/rllte

Long-Term Evolution Project of Reinforcement Learning

51
Established
46 pathak22/noreward-rl

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep...

51
Established
47 archsyscall/DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning...

51
Established
48 rail-berkeley/softlearning

Softlearning is a reinforcement learning framework for training maximum...

50
Established
49 Kaixhin/Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

50
Established
50 jingweiz/pytorch-rl

Deep Reinforcement Learning with pytorch & visdom

50
Established
51 nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization...

50
Established
52 mimoralea/applied-reinforcement-learning

Reinforcement Learning and Decision Making tutorials explained at an...

50
Established
53 Fraunhofer-IIS/fmugym

Interface to connect Reinforcement Learning libraries to Functional Mock-up...

50
Established
54 pat-coady/trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

50
Established
55 udacity/reinforcement-learning

Reinforcement learning material, code and exercises for Udacity Nanodegree programs.

50
Established
56 ikostrikov/pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

50
Established
57 opendilab/DI-engine-docs

DI-engine docs (Chinese and English)

49
Emerging
58 nrontsis/PILCO

Bayesian Reinforcement Learning in Tensorflow

49
Emerging
59 seungeunrho/minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

49
Emerging
60 denisyarats/pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

49
Emerging
61 rmst/ddpg

TensorFlow implementation of the DDPG algorithm from the paper Continuous...

49
Emerging
62 Stable-Baselines-Team/stable-baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of...

48
Emerging
63 andri27-ts/Reinforcement-Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python....

48
Emerging
64 lucidrains/metacontroller

Implementation of the MetaController proposed in "Emergent temporal...

48
Emerging
65 binary-husky/hmp2g

Multiagent Reinforcement Learning Research Project

48
Emerging
66 vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning...

48
Emerging
67 alessiodm/drl-zh

Deep Reinforcement Learning: Zero to Hero!

48
Emerging
68 rl-language/rlc

Bringing reinforcement learning to every day programmers

48
Emerging
69 zyxue/sutton-barto-rl-exercises

📖Learning reinforcement learning by implementing the algorithms from...

47
Emerging
70 ericyangyu/PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series:...

47
Emerging
71 gordicaleksa/pytorch-learn-reinforcement-learning

A collection of various RL algorithms like policy gradients, DQN and PPO....

47
Emerging
72 HewlettPackard/dc-rl

SustainDC is a set of Python environments for Data Center simulation and...

47
Emerging
73 Cloudslab/DLSF

[TMC'20] Deep Learning based Scheduler for Stochastic Fog-Cloud computing...

47
Emerging
74 CarperAI/trlx

A repo for distributed training of language models with Reinforcement...

46
Emerging
75 TianhongDai/reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep...

46
Emerging
76 SuhailSama/MR_RL

Gym Simulator for Magnetic Micro Robots

46
Emerging
77 godka/Pensieve-PPO

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art...

46
Emerging
78 medipixel/rl_algorithms

Structural implementation of RL key algorithms

45
Emerging
79 Anjum48/rl-examples

Examples of published reinforcement learning algorithms in recent literature...

45
Emerging
80 google-deepmind/dm_env

A Python interface for reinforcement learning environments

45
Emerging
81 IBM/LOA

Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel...

45
Emerging
82 yihaosun1124/OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

45
Emerging
83 gabrielhuang/reptile-pytorch

A PyTorch implementation of OpenAI's REPTILE algorithm

45
Emerging
84 denisyarats/drq

DrQ: Data regularized Q

44
Emerging
85 huangwl18/modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular...

44
Emerging
86 DeNA/HandyRL

HandyRL is a handy and simple framework based on Python and PyTorch for...

44
Emerging
87 mahyaret/kuka_rl

Reinforcement Learning Experiments using PyBullet

44
Emerging
88 sudharsan13296/Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using...

43
Emerging
89 rlgraph/rlgraph

RLgraph: Modular computation graphs for deep reinforcement learning

43
Emerging
90 simoninithomas/Deep_reinforcement_learning_Course

Implementations from the free course Deep Reinforcement Learning with...

43
Emerging
91 mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or...

43
Emerging
92 omerbsezer/Reinforcement_learning_tutorial_with_demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration),...

43
Emerging
93 yrlu/irl-imitation

Implementation of Inverse Reinforcement Learning (IRL) algorithms in...

43
Emerging
94 UoA-CARES/cares_reinforcement_learning

CARES Reinforcement Learning Package

42
Emerging
95 Learning4Optimization-HUST/H-TSP

Official implementation of H-TSP (AAAI2023)

42
Emerging
96 Bellman-devs/bellman

Model-based reinforcement learning in TensorFlow

42
Emerging
97 sebastianbrzustowicz/Robot-Sumo-RL

Python + PyTorch. Advanced Reinforcement Learning (SAC/PPO/A2C) for...

42
Emerging
98 andrewliao11/Deep-Reinforcement-Learning-Survey

My Exploration on Deep Reinforcement Learning Survey

42
Emerging
99 adrianwix/pybasin

pyBasin is a Python library for estimating basin stability in dynamical...

42
Emerging
100 dvalenciar/ReinforceUI-Studio

ReinforceUI-Studio. A Python-based application designed to simplify the...

42
Emerging
101 denisyarats/proto

Proto-RL: Reinforcement Learning with Prototypical Representations

41
Emerging
102 MaartenGr/ReinLife

Creating Artificial Life with Reinforcement Learning

41
Emerging
103 Kaixhin/imitation-learning

Imitation learning algorithms

41
Emerging
104 MarcoMeter/recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

41
Emerging
105 khushhallchandra/pytorch-rl

Pytorch Implementation of RL algorithms

41
Emerging
106 tayalmanan28/Safe_Reinforcement_Learning

Repository containing the code for safe reinforcement learning in two custom...

41
Emerging
107 thanhkaist/CCFDM1

CCFDM reinforcement learning

41
Emerging
108 XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep...

41
Emerging
109 airboxlab/rllib-energyplus

Simple EnergyPlus environments for control optimization using reinforcement learning

41
Emerging
110 NatLabRockies/graph-env

Reinforcement learning for combinatorial optimization over directed graphs

41
Emerging
111 rmst/rlrd

PyTorch implementation of our paper Reinforcement Learning with Random...

41
Emerging
112 mitre/ilpyt

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

40
Emerging
113 ItoMasaki/PixyzRL

A Bayesian RL Framework with Probabilistic Generative Models

40
Emerging
114 whoiszyc/IntelliHealer

IntelliHealer: An imitation and reinforcement learning platform for...

40
Emerging
115 tirthajyoti/RL_basics

Basic Reinforcement Learning algorithms

40
Emerging
116 rllab-snu/Deep-Reinforcement-Learning

Introduction to Deep Reinforcement Learning

40
Emerging
117 TheoLvs/reinforcement-learning

Personal experiments on Reinforcement Learning

40
Emerging
118 MishaLaskin/rad

RAD: Reinforcement Learning with Augmented Data

40
Emerging
119 zuoxingdong/lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement...

40
Emerging
120 antonpuz/DeROL

Deep Reinforcement One-Shot Learning Framework for Artificially Intelligent...

40
Emerging
121 araffin/rl-handson-rlvs21

Stable-Baselines3 (SB3) reinforcement learning tutorial for the...

40
Emerging
122 nsidn98/NICE

Combining Reinforcement Learning with Integer Programming for Robust Scheduling

40
Emerging
123 EsratMaria/Reinforcement-Learning_for_Energy_Minimization_Using_CLoudsim

Implementation of RL in the cloud for energy minimization due to migration...

39
Emerging
124 RLE-Foundation/RLeXplore

RLeXplore provides stable baselines of exploration methods in reinforcement...

39
Emerging
125 Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind

🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind |...

39
Emerging
126 luisgarciar/3D-bin-packing

Solving the 3D bin packing problem with reinforcement learning

39
Emerging
127 UlisseMini/procgen-tools

Tools for running experiments on RL agents in procgen environments

39
Emerging
128 921kiyo/symbolic-rl

Symbolic Reinforcement Learning using Inductive Logic Programming

39
Emerging
129 YuvrajSingh-mist/NeatRL

Repository of implementations of classic and sota rl algorithms from scratch...

38
Emerging
130 appgym/appgym

Mobile Apps (Android) as Environment for Reinforcement Learning Agents

37
Emerging
131 dalmia/udacity-deep-reinforcement-learning

My solutions to the projects (and mini-projects) of the Deep Reinforcement...

37
Emerging
132 takuseno/minerva

An out-of-the-box GUI tool for offline deep reinforcement learning

37
Emerging
133 astier/model-free-episodic-control

Model-Free-Episodic-Control implementation.

37
Emerging
134 chengxi600/RLStuff

A collection of reinforcement learning algorithm implementations

37
Emerging
135 anassinator/pddp

WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch

37
Emerging
136 AdamStelmaszczyk/rl-tutorial

Source code for "A deep dive into reinforcement learning"

37
Emerging
137 VachanVY/Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An...

37
Emerging
138 LAMDA-RL/ODIS

The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent...

37
Emerging
139 saqib1707/RL-PPO-PyTorch

Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch

36
Emerging
140 shehio/rl

Implementing RL agents, one algorithm at a time

36
Emerging
141 BY571/CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL....

36
Emerging
142 kochlisGit/Shadow-Hand-Controller

Construction of controllers for Shadow-Hand in Mujoco environment, using...

36
Emerging
143 denisyarats/exorl

ExORL: Exploratory Data for Offline Reinforcement Learning

36
Emerging
144 navneet-nmk/pytorch-rl

This repository contains model-free deep reinforcement learning algorithms...

36
Emerging
145 BNN-UPC/ENERO

Code used in the paper "ENERO: Efficient real-time WAN routing optimization...

36
Emerging
146 jayLEE0301/dhrl_official

Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse...

35
Emerging
147 asystemoffields/disco-torch

A PyTorch port of DeepMind's Disco103 — the meta-learned reinforcement...

35
Emerging
148 sdpkjc/abcdrl

Modular Single-file Reinfocement Learning Algorithms Library

35
Emerging
149 schmidtdominik/Rainbow

Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient...

34
Emerging
150 VladGavra98/SERL

Safety-informed Evolutionary Reinforcement Learning applied to...

34
Emerging
151 affaan-m/Behavioral_RL

Reinforcement Learning with human behavioral biases integration

34
Emerging
152 Shaswat2001/maple-robotics

MAPLE (Model and Policy Learning Evaluation) - A unified CLI daemon for...

34
Emerging
153 opium-sh/prl

Open-source library for a reinforcement learning research.

34
Emerging
154 reward-scope-ai/reward-scope

Real-time reward debugging and hacking detection for reinforcement learning

34
Emerging
155 navneet-nmk/Pytorch-RL-CPP

A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)

34
Emerging
156 jimimvp/torch_rl

Reinforcement learning library for PyTorch.

33
Emerging
157 NYU-MLDA/ABC-RL

This is work-in-progress (WIP) refactored implementation of...

33
Emerging
158 AdamStelmaszczyk/dqn

TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)

33
Emerging
159 nnaisense/pgpelib

A mini library for Policy Gradients with Parameter-based Exploration, with...

33
Emerging
160 Asap7772/PTR

This repository contains the implementation of the PTR algorithm described...

32
Emerging
161 lucaslingle/pytorch_rl2

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

32
Emerging
162 cgel/DRL

A collection of Deep Reinforcement Learning algorithms implemented in...

32
Emerging
163 CLAIRE-Labo/no-representation-no-trust

Codebase to fully reproduce the results of "No Representation, No Trust:...

32
Emerging
164 ialexmp/DRL-Generalization

Exploring Generalization in Deep Reinforcement Learning algorithms for...

32
Emerging
165 matthieu637/ddrl

Deep Developmental Reinforcement Learning

32
Emerging
166 haron1100/Upside-Down-Reinforcement-Learning

Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch

32
Emerging
167 FabioMiguel2000/LOA-feat.Reinforcement-Learning

Assigment 2 for Course L.EIC029 Artificial Intelligence, FEUP LEIC 3rd Year...

32
Emerging
168 linker81/Reinforcement-Learning-CheatSheet

Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition)

31
Emerging
169 harshaljanjani/taskschedulingdqn

Designing energy-aware scheduling and task allocation algorithms for online...

31
Emerging
170 mbchang/decentralized-rl

Decentralized Reinforcment Learning: Global Decision-Making via Local...

31
Emerging
171 goktug97/PEPG-ES

Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy

31
Emerging
172 JeepWay/DeepPack

Unofficial implementation of DeepPack in PyTorch. DeepPack is a deep...

30
Emerging
173 Brownwang0426/Reversal-Generative-Reinforcement-Learning

A simple model-free and value-function-free reinforcement learning model

30
Emerging
174 WinDerek/reinforce-py

Reinforcement learning agents in Python (dynamic programming,...

30
Emerging
175 lucadellalib/actorch

Deep reinforcement learning framework for fast prototyping based on PyTorch

30
Emerging
176 Daraan/ray_utilities

ray & RLlib tools for unified code across different repositories....

30
Emerging
177 teepanis/nonlinear-pendulum

Data and Code Availability -- Universal spectral structure in pendulum-like systems

30
Emerging
178 voaneves/colab-rl

Keras implementation of the latest Reinforcement Learning algorithms, ready...

30
Emerging
179 kyegomez/HindsightReplay

My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"

30
Emerging
180 Skw3mdy/Reinforcement-Learning-Projects

🤖 Explore reinforcement learning techniques with projects including a taxi...

30
Emerging
181 ProfessorNova/PPO-Humanoid

PPO implementation for controlling a humanoid in Gymnasium's Mujoco...

30
Emerging
182 mindspore-courses/Rainbow-MindSpore

About Rainbow-MindSpore! A step-by-step tutorial from DQN to Rainbow

30
Emerging
183 enjeeneer/zero-shot-rl

VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low...

30
Emerging
184 0xnu/deep-reinforcement-learning

Deep Reinforcement Learning (DRL)

30
Emerging
185 rickstaa/stable-learning-control

A framework for training theoretically stable (and robust) Reinforcement...

29
Experimental
186 tartavull/alfredo

Relentlessly learning, persistently failing, but never surrendering.

29
Experimental
187 nunesma/reinforcement_learning

Deep reinforcement learning techniques for artificial intelligence project

29
Experimental
188 ErickRosete/tacorl

TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning

29
Experimental
189 rbosh/ml-adp

Approximate dynamic programming for stochastic optimal control in Pytorch

29
Experimental
190 Naighten/track-simulator

Код магистрантской дипломной работы студента НГТУ им Р.Е. Алексеева Жукова...

29
Experimental
191 Space-Robotics-Laboratory/rlstar

RL STaR is a platform for creating AI for robotic applications. Researchers...

29
Experimental
192 dlb-rl/pulse-rl

Code for PulseRL: Enabling Offline Reinforcement Learning for Digital...

28
Experimental
193 Yuxing-Wang-THU/Surrogate-assisted-ERL

A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning

28
Experimental
194 motokiomura/Q-DOT

[RLC 2025] Official code repository for "Offline Reinforcement Learning with...

28
Experimental
195 NVlabs/RL-TNCO

RL-TNCO: A reinforcement learning algorithm for solving the tensor network...

28
Experimental
196 enginBozkurt/Deep-Reinforcement-Learning-for-Enterprise-Nanodegree

Udacity Deep Reinforcement Learning for Enterprise Nanodegree Projects

27
Experimental
197 cubrink/mujoco-2.1-rl-project

Implementing Deep Reinforcement Learning Algorithms in Python for use in the...

27
Experimental
198 natetsang/open-rl

Implementations of a large collection of reinforcement learning algorithms.

27
Experimental
199 MiscellaneousStuff/tlol-rl

TLoL (Reinforcement Learning Python Module) - League of Legends RL Module...

27
Experimental
200 shivakanthsujit/reducible-loss

Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss

27
Experimental
201 Jaehyun-Jeong/100LinesRL

Clean RL algorithm implementations in under 100 lines each.

27
Experimental
202 JaydenTeoh/MORL-Generalization

Benchmark for evaluating the generalization capabilities of Multi-Objective...

27
Experimental
203 amr-khaled164/GRLMSL

🚀 Optimize microservice instance selection and load balancing in edge...

27
Experimental
204 HzcIrving/DLRL-PlayGround

The code repo contains multiple code reproduction processes of various SOTA...

26
Experimental
205 declanoller/cat_mouse_continuous_RL

Using DDPG and A2C reinforcement learning algorithms to solve a math puzzle

26
Experimental
206 sapanz/Udacity-deep-reinforcement-learning-solution

This repo will cover most of machine learning algorithms with coding examples.

26
Experimental
207 ugr-sail/paper-drl_building

Supplementary material to the paper "An experimental evaluation of Deep...

26
Experimental
208 prototwin/RLExamples

PotoTwin Reinforcement Learning Examples

25
Experimental
209 Defenser1337/Reinforcement-learning-for-Gradient-descent

Application of reinforcement learning to train hyperparameters of gradient...

25
Experimental
210 Now-Join-Us/V0

The code repository for "$V_0$: A Generalist Value Model for Any Policy at...

25
Experimental
211 dalmia/P2_Continuous_Control

My solution code for the second project of Udacity's Deep Reinforcement...

25
Experimental
212 trunghng/reinforcement_learning_an_introduction

Python Implementation for problems in Reinforcement Learning - An Introduction book

24
Experimental
213 rafelps/learning-recursive-goal-proposal

Learning Recursive Goal Proposal: A hierarchical Reinforcement Learning Approach

24
Experimental
214 bmazoure/ppo_jax

Jax implementation of Proximal Policy Optimization (PPO) specifically tuned...

24
Experimental
215 KeepALifeUS/ml-dqn

Rainbow DQN: Double, Dueling, PER, Noisy Nets. Atari benchmarks. PyTorch.

24
Experimental
216 dayyass/rllib

Reinforcement Learning Library.

23
Experimental
217 andranik-sahakyan/team-tron-rl

Multi-Agent Reinforcement Learning project exploring the emergence and...

23
Experimental
218 adaptive-intelligent-robotics/HTE

This is the repository for the paper Hierarchical Quality-Diversity for...

23
Experimental
219 mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore

MindSpore implementations of deep reinforcement learning algorithms and environments

23
Experimental
220 rorofaiz/awesome-RLVR-boundary

🔍 Explore curated resources on Reinforcement Learning with Verifiable...

23
Experimental
221 fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks

Implementing a new environment in Gymnasium for intelligent schduling

22
Experimental
222 xValentim/ReinforcementLearning_Zero_to_Hero_Course

In this repository you will learn all the basic math about Reinforcement...

22
Experimental
223 Rudge0/DynaMO-RL

Optimize policy learning by dynamically allocating rollouts and modulating...

22
Experimental
224 32olaa/reward-scope

🔍 Detect reward hacking in RL training with RewardScope. Track reward...

22
Experimental
225 HGVAbyte/rlhf-data-agent-full

🔍 Generate synthetic preference-ranked datasets for RLHF and DPO training,...

22
Experimental
226 AlirezaShamsoshoara/RL-from-zero

Comprehensive collection of reinforcement learning algorithms implemented...

22
Experimental
227 hmomin/PPO-Winter-Run

Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run

22
Experimental
228 icaros-usc/dqd-rl

Official implementation of "Approximating Gradients for Differentiable...

22
Experimental
229 bay3s/ppo-parallel

Parallelized implementation of Proximal Policy Optimization (PPO).

22
Experimental
230 ARgruny/Deep-Reinforcement-Learning

Build and test DRL algorithms in different environments

22
Experimental
231 mohmdelsayed/TinyRL

Real-Time Deep RL That Fits in Small Devices

22
Experimental
232 aminkhani/Deep-RL

You can see a reference for Books, Articles, Courses and Educational...

22
Experimental
233 victor369basu/MyosuiteDDQN

In this repository, we try to solve musculoskeletal tasks with `Double DQN...

22
Experimental
234 davirenner88-rgb/LR-S

🚀 Emulate Arknights: Endfield servers with LR-S for seamless game...

22
Experimental
235 ankitsharma-tech/Deep-Reinforcement-Learning-With-Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3.

22
Experimental
236 liyan2015/SUMO-RL-MobiCharger

OpenAI-gym-like Reinforcement Learning environment for Dispatching of Mobile...

22
Experimental
237 silviomori/udacity-deep-reinforcement-learning-p2-continuous-control

Create and train a double-jointed arm agent that is able to maintain its...

22
Experimental
238 mercurycontaminated-sandarac557/KnapsackRL

🎯 Optimize exploration budgets in Reinforcement Learning with KnapsackRL for...

22
Experimental
239 mlnjsh/Reinforcement_Learning_Projects

20 RL basics notebooks + 10 advanced projects with Streamlit apps covering...

22
Experimental
240 Axel-Bravo/19_udacity_drlnd

Deep Reinforcement Learning Nanodregree from Udacity

21
Experimental
241 TroddenSpade/Exhaustive-Reinforcement-Learning

Exhaustive Implementation of Algorithms, Key Papers, and Well-Known Problems...

21
Experimental
242 BackpropTools/BackpropTools

A Fast, Portable Deep Reinforcement Learning Library for Continuous Control

21
Experimental
243 GTR-GAMES/Deep-Hierarchical-Planning

🔍 Implement efficient long-horizon task planning with this PyTorch...

21
Experimental
244 Aryia-Behroziuan/Robot-learning

In developmental robotics, robot learning algorithms generate their own...

21
Experimental
245 PatrickSinger99/ReinforcementLearningInventoryManagement

Repository for my bachelor thesis on inventory management in a logistics...

21
Experimental
246 motokiomura/annealed-q-learning

[ICML 2025] Official code repository for "Gradual Transition from Bellman...

21
Experimental
247 a7med3laa/DRL-Books-resources

Deep Reinforcement Learning Books and links for studying

21
Experimental
248 julia-bel/MAPF_G2RL

Implementation of the G2RL approach in the POGEMA environment

21
Experimental
249 iliasoroka1/GRU_Lyapunov_Spectrum

Lyapunov Spectrum for Double Pendulum using GRU

21
Experimental
250 zhuzhipeng-123/reinforce-study-for-mmm

Reinforcement Learning Research - Exploring RL algorithms in practical scenarios

21
Experimental
251 Jcorrieri/multiagent-gridworld

Deep Reinforcement Learning for Multi-Robot Path Planning using PyTorch, Ray...

21
Experimental
252 mbar0075/Advanced-Reinforcement-Learning

Deliverables relating to the Advanced Reinforcement Learning University Unit

20
Experimental
253 AndersonPeng/ppo_tutorial

PPO pytorch tutorial for continuous control (BipedalWalker-v3)

20
Experimental
254 manjavacas/rl-temario

Temario sobre aprendizaje por refuerzo en español.

20
Experimental
255 soovittt/RL-Studio

A full-stack platform for designing reinforcement learning environments,...

20
Experimental
256 rStar-RL/LoongRL

LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts...

20
Experimental
257 uzumstanley/DEEP-LEARNING

UNIVERSITY OF ROEHAMPTON LONDON

20
Experimental
258 CarsonScott/Dual-Process-Reinforcement

An intelligent agent that adaptively changes its thought processes to...

19
Experimental
259 TroddenSpade/Maximum-Entropy-Deep-IRL

Implementations of Maximum Entropy Algorithms for solving Inverse...

19
Experimental
260 brianspiering/rl-course

Applied Reinforcement Learning course

19
Experimental
261 ashworks1706/kaelum

LATS-based inference with a reward model and online policy router across...

19
Experimental
262 openpsi-projects/srl

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

19
Experimental
263 gabotechs/lazaro

Reinforcement learning framework for implementing custom models on custom...

19
Experimental
264 ArdavanKhalij/RL-Seminar-Project

This project is the project of RL course at Vrije Universiteit Brussels and...

18
Experimental
265 TroddenSpade/Meta-Reinforcement-Learning

Code snippets of Meta Reinforcement Learning algorithms

18
Experimental
266 mhahsler/Introduction_to_Reinforcement_Learning

Material for an introduction course to reinforcement learning for compute scientists

18
Experimental
267 zchuning/repo

Resilient Model-Based RL by Regularizing Posterior Predictability

18
Experimental
268 yamatokataoka/learning-from-human-preferences

Replication of Deep Reinforcement Learning from Human Preferences...

18
Experimental
269 ArdavanKhalij/MDP

machine-learning reinforcement-learning artificial-intelligence...

18
Experimental
270 Bonifatius94/rl-algos

SOTA Reinforcement Learning Algorithms

18
Experimental
271 mohamedrxo/ppo

A comprehensive repository for training OpenAI Gym environments using...

17
Experimental
272 mlnjsh/rl-book-labs

🎮 Interactive browser-based labs for "Complete Reinforcement Learning...

17
Experimental
273 Uzi-gpu/reinforcement-learning

Reinforcement Learning projects with Q-Learning, Actor-Critic, and REINFORCE...

17
Experimental
274 undextrois/reinforcement-learning

RL Experiments and what not

17
Experimental
275 micdestefano/micppo

An implementation of Proximal Policy Optimization (PPO)

17
Experimental
276 Saifahmadkhan/PlugNPlay

This library is a PlugNPlay version of our novel pipeline VacSIM. We have...

17
Experimental
277 Talendar/pyderl

Simple Deep Reinforcement Learning framework for Python.

17
Experimental
278 PieroMacaluso/collaboration-n-competition

Implementation of Multi-Agent Deep Deterministic Policy Gradient (MADDPG)...

17
Experimental
279 snthomps/rlhf-ppo-pipeline

RLHF/PPO Training Pipeline with Performance Profiling and Optimization Demonstrations

17
Experimental
280 kodok13/Label-Free-RLVR

📚 Explore a curated collection of research on Label-Free Reinforcement...

14
Experimental
281 yelurebajrang/HeteroRL_GEPO

⚡ Optimize heterogeneous reinforcement learning with GEPO for decentralized...

14
Experimental
282 Lare1998/rl-for-robotics

Reinforcement Learning applications for robotic control and task automation.

14
Experimental
283 PathumDilhara/RL-agent-for-CNN-hyper-parameter-optimization

A reinforcement learning (RL) based agent that automatically tunes...

14
Experimental
284 igeng/awesome-drl-cloud-scheduling

A curated list of research papers, code, and tools applying deep...

14
Experimental
285 Madid1976/reinforcement-learning-agents

Implementations of various reinforcement learning algorithms and agents for...

14
Experimental
286 thevilledev/elements-of-ai-idea

Project pitch on using reinforcement learning for resource scheduling

13
Experimental
287 Develop-Packt/Building-an-Artificial-Intelligence-Algorithm

Learn how to build a machine learning mode and get started on the popular...

13
Experimental
288 Tahernezhad/Continuous-Control-Workbench

A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is...

13
Experimental
289 MatTheTab/GHOST_RL_materials

Materials for Reinforcement Learning and Machine Learning in games for GHOST.

13
Experimental
290 creativequotient/reinforcement-learning

reinforcement learning algorithms implemented in pytorch

12
Experimental
291 s1ddh-rth/HumanoidClimb-RL

This project explores the application of reinforcement learning (RL) to...

12
Experimental
292 alexkalergis/Robot-writing-via-Reinforcement-Learning

• Master Thesis • In this project we aim to create a virtual environment of...

11
Experimental
293 eg424/DDPG-Pendulum

Deep Deterministic Policy Gradient for Healthcare

11
Experimental
294 Pablo-Caballero-Maria/Reinforcement-learning-in-robotics

Jupyter notebook for reinforcement learning algorithm comparison

11
Experimental
295 fcrlab-unime/rl-examples

Collection of examples for Reinforcement Learning

11
Experimental
296 sagarnildass/Udacity-Deep-Reinforcement-Learning-Nanodegree

Projects and Exercises of Udacity Deep Reinforcement Learning Nanodegree

11
Experimental
297 PieroMacaluso/continuous-control

Implementation of Deep Deterministic Policy Gradient (DDPG) for the Reacher...

11
Experimental
298 PieroMacaluso/navigation

Implementation of Deep Q Network for the Banana environment in the context...

11
Experimental
299 juliusfrost/RL-library-activity-tracker

A simple RL library activity tracker with github badges

11
Experimental
300 LearningFish/rlai2cpp

C++ Implementation of Reinforcement Learning: An Introduction (2nd Edition)

11
Experimental
301 paogam1997/Deep-Reinforcement-Learning

This repository is still in progress, it contains the file created to the...

10
Experimental
302 niho/ReinforcementLearning

Deep Reinforcement Learning in Swift

10
Experimental
303 adielashrov/Enhance-DL-with-SBM-Modelsward2023

Repository for "Enhancing Deep Learning with Scenario-Based Override Rules:...

10
Experimental
304 mahfuztbt/Investigating-the-Use-of-Reinforcement-Learning-for-Autonomous-Decision-Making-in-Robotics

Hi I am applying for my master's degree in China. This repo represent my...

10
Experimental