CartPole Reinforcement Learning ML Frameworks

Educational implementations of reinforcement learning algorithms (DQN, SARSA, Q-Learning, A2C, DDPG) applied specifically to the CartPole control problem. Does NOT include general RL frameworks, other environments/benchmarks, or non-RL control methods.

There are 36 cartpole reinforcement learning frameworks tracked. 1 score above 50 (established tier). The highest-rated is WilliamLwj/PyXAB at 55/100 with 127 stars.

Get all 36 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=cartpole-reinforcement-learning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 WilliamLwj/PyXAB

PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization...

55
Established
2 jekyllstein/Reinforcement-Learning-Sutton-Barto-Exercise-Solutions

Chapter notes and exercise solutions for Reinforcement Learning: An...

47
Emerging
3 cfoh/Multi-Armed-Bandit-Example

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB,...

44
Emerging
4 matteocasolari/reinforcement-learning-an-introduction-solutions

Implementations for solutions to programming exercises of Reinforcement...

41
Emerging
5 BY571/Upside-Down-Reinforcement-Learning

Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on...

41
Emerging
6 iamhectorotero/rlai-exercises

Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]

39
Emerging
7 hypnosapos/cartpole-rl-remote

CartPole game by Reinforcement Learning, a journey from training to inference

39
Emerging
8 ocraft/rl-sandbox

Selected algorithms and exercises from the book Sutton, R. S. & Barton, A.:...

34
Emerging
9 dynamicslab/MultiArm-Pendulum

This repository is for our paper: "The Experimental Multi-Arm Pendulum on a...

34
Emerging
10 gerdm/reinforcement-learning

Repository of notes, code and notebooks in Python for the book...

33
Emerging
11 nicklashansen/reinforcement-learning-sutton-barto

Personal repository for course on reinforcement learning. Includes...

30
Emerging
12 thetawom/mabby

A multi-armed bandit (MAB) simulation library in Python

29
Experimental
13 bprabhakar/upside-down-reinforcement-learning

Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by...

28
Experimental
14 singhsidhukuldeep/contextual-bandits

A comprehensive Python library implementing a variety of contextual and...

27
Experimental
15 marlesson/meta-bandit-selector

The Contextual Meta-Bandit (CMB) can be used to select models using the...

27
Experimental
16 pacalab/rl_sutton_barto

Reinforcement Learning (Sutton, Barto) - solved exercises

22
Experimental
17 victor-iyi/multi-armed-bandit-with-policy-gradient

A multi armed bandit Reinforcement learning problem using Policy Gradient.

21
Experimental
18 Maskiabdo97/cartpole-ts

🤖 Control a cart-pole system using TypeScript with a focus on implementing...

21
Experimental
19 kambhampati-vijaya-sri-vyshnavi-devi89/dqn-rl-agent

DQN agent solving CartPole-v1 and LunarLander-v2 with Experience Replay,...

21
Experimental
20 navidadkhah/CartPole-V1

CartPole problem solved using two Reinforcement learning algorithms (DQN and...

20
Experimental
21 cezarbulancea/CartPole-RL

Implementation of several RL algorithms on the CartPole-v1 environment.

19
Experimental
22 SanketAgrawal/ReinforcementLearning

Chapter wise implementation & analysis of all the algorithms in RL : An...

19
Experimental
23 rmitsuboshi/bandit

A small collection of Bandit algorithms, written in Rust 🦀.

19
Experimental
24 cezarbulancea/Multi-Armed-Bandits

Implementation of several multi-armed bandit problems.

18
Experimental
25 gunh0/reinforcement-learning-cartpole-balancing

📢 2019 Microsoft Student Partners (MSP) Evangelism Seminar - 2019.03.31

18
Experimental
26 iiShreya/cartPoleEnv_hillClimbingAlgo

Hill Climbing Algorithm implemented for the Cart Pole Environment.

17
Experimental
27 bcorfman/sb3-trial

Stable Baselines 3 Cartpole example configured with Rye as dependency manager.

17
Experimental
28 mtichikawa/bandit-ab-testing

Multi-armed bandit framework for adaptive A/B testing (Thompson Sampling,...

14
Experimental
29 oalvarobraz/pytorch-cartpole-rl

A from-scratch Deep Reinforcement Learning (DQN) agent built with PyTorch to...

14
Experimental
30 MikiTwenty/cart-pole-agent

Personal Project

13
Experimental
31 zy31415/jackscarrental

Jack's Car Rental - A Reinforcement Learning Example Using Python (See...

12
Experimental
32 formidablae/Batched_Multi-armed_Bandits

Batched Multi-armed Bandits Problem - Analisi critica. Artificial...

12
Experimental
33 tentone/cart-pendulum

Small cart w/ inverted pendulum game for basic machine learning concept experiments.

11
Experimental
34 renan-siqueira/reinforcement-learning-cart-pole

This repository provides implementations of a Q-learning agent to balance a...

11
Experimental
35 LazyTurtle/Pendulum_Q_Learning

This repository contains the code necessary to run a Q-Learning algorithm...

11
Experimental
36 DomenSoberl/swing-up

A DDPG solution to the cart-pole swing-up problem.

11
Experimental