CartPole Reinforcement Learning ML Frameworks

Educational implementations of reinforcement learning algorithms (DQN, SARSA, Q-Learning, A2C, DDPG) applied specifically to the CartPole control problem. Does NOT include general RL frameworks, other environments/benchmarks, or non-RL control methods.

There are 36 cartpole reinforcement learning frameworks tracked. 1 score above 50 (established tier). The highest-rated is WilliamLwj/PyXAB at 55/100 with 127 stars.

Get all 36 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=cartpole-reinforcement-learning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	WilliamLwj/PyXAB PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization...	55	Established	127	Python
2	jekyllstein/Reinforcement-Learning-Sutton-Barto-Exercise-Solutions Chapter notes and exercise solutions for Reinforcement Learning: An...	47	Emerging	49	Julia
3	cfoh/Multi-Armed-Bandit-Example Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB,...	44	Emerging	45	Python
4	matteocasolari/reinforcement-learning-an-introduction-solutions Implementations for solutions to programming exercises of Reinforcement...	41	Emerging	34	Python
5	BY571/Upside-Down-Reinforcement-Learning Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on...	41	Emerging	78	Jupyter Notebook
6	iamhectorotero/rlai-exercises Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]	39	Emerging	155	Jupyter Notebook
7	hypnosapos/cartpole-rl-remote CartPole game by Reinforcement Learning, a journey from training to inference	39	Emerging	25	Python
8	ocraft/rl-sandbox Selected algorithms and exercises from the book Sutton, R. S. & Barton, A.:...	34	Emerging	6	Python
9	dynamicslab/MultiArm-Pendulum This repository is for our paper: "The Experimental Multi-Arm Pendulum on a...	34	Emerging	19	MATLAB
10	gerdm/reinforcement-learning Repository of notes, code and notebooks in Python for the book...	33	Emerging	37	Jupyter Notebook
11	nicklashansen/reinforcement-learning-sutton-barto Personal repository for course on reinforcement learning. Includes...	30	Emerging	2	Jupyter Notebook
12	thetawom/mabby A multi-armed bandit (MAB) simulation library in Python	29	Experimental	9	Python
13	bprabhakar/upside-down-reinforcement-learning Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by...	28	Experimental	11	Jupyter Notebook
14	singhsidhukuldeep/contextual-bandits A comprehensive Python library implementing a variety of contextual and...	27	Experimental	13	Python
15	marlesson/meta-bandit-selector The Contextual Meta-Bandit (CMB) can be used to select models using the...	27	Experimental	9	Jupyter Notebook
16	pacalab/rl_sutton_barto Reinforcement Learning (Sutton, Barto) - solved exercises	22	Experimental	2	Jupyter Notebook
17	victor-iyi/multi-armed-bandit-with-policy-gradient A multi armed bandit Reinforcement learning problem using Policy Gradient.	21	Experimental	9	Jupyter Notebook
18	Maskiabdo97/cartpole-ts 🤖 Control a cart-pole system using TypeScript with a focus on implementing...	21	Experimental	—	TypeScript
19	kambhampati-vijaya-sri-vyshnavi-devi89/dqn-rl-agent DQN agent solving CartPole-v1 and LunarLander-v2 with Experience Replay,...	21	Experimental	—	HTML
20	navidadkhah/CartPole-V1 CartPole problem solved using two Reinforcement learning algorithms (DQN and...	20	Experimental	6	Python
21	cezarbulancea/CartPole-RL Implementation of several RL algorithms on the CartPole-v1 environment.	19	Experimental	4	Python
22	SanketAgrawal/ReinforcementLearning Chapter wise implementation & analysis of all the algorithms in RL : An...	19	Experimental	3	Jupyter Notebook
23	rmitsuboshi/bandit A small collection of Bandit algorithms, written in Rust 🦀.	19	Experimental	3	Rust
24	cezarbulancea/Multi-Armed-Bandits Implementation of several multi-armed bandit problems.	18	Experimental	2	Python
25	gunh0/reinforcement-learning-cartpole-balancing 📢 2019 Microsoft Student Partners (MSP) Evangelism Seminar - 2019.03.31	18	Experimental	2	Jupyter Notebook
26	iiShreya/cartPoleEnv_hillClimbingAlgo Hill Climbing Algorithm implemented for the Cart Pole Environment.	17	Experimental	1	Jupyter Notebook
27	bcorfman/sb3-trial Stable Baselines 3 Cartpole example configured with Rye as dependency manager.	17	Experimental	1	Makefile
28	mtichikawa/bandit-ab-testing Multi-armed bandit framework for adaptive A/B testing (Thompson Sampling,...	14	Experimental	—	Jupyter Notebook
29	oalvarobraz/pytorch-cartpole-rl A from-scratch Deep Reinforcement Learning (DQN) agent built with PyTorch to...	14	Experimental	—	Python
30	MikiTwenty/cart-pole-agent Personal Project	13	Experimental	—	Jupyter Notebook
31	zy31415/jackscarrental Jack's Car Rental - A Reinforcement Learning Example Using Python (See...	12	Experimental	8	Python
32	formidablae/Batched_Multi-armed_Bandits Batched Multi-armed Bandits Problem - Analisi critica. Artificial...	12	Experimental	8	Python
33	tentone/cart-pendulum Small cart w/ inverted pendulum game for basic machine learning concept experiments.	11	Experimental	—	CoffeeScript
34	renan-siqueira/reinforcement-learning-cart-pole This repository provides implementations of a Q-learning agent to balance a...	11	Experimental	—	Python
35	LazyTurtle/Pendulum_Q_Learning This repository contains the code necessary to run a Q-Learning algorithm...	11	Experimental	—	Python
36	DomenSoberl/swing-up A DDPG solution to the cart-pole swing-up problem.	11	Experimental	—	C++

Comparisons in this category

reinforcement-learning-an-introduction-solutions and rlai-exercises (41 vs 39) rlai-exercises and rl-sandbox (39 vs 34) rlai-exercises and rl_sutton_barto (39 vs 22) Upside-Down-Reinforcement-Learning and upside-down-reinforcement-learning (41 vs 28) Reinforcement-Learning-Sutton-Barto-Exercise-Solutions and rl_sutton_barto (47 vs 22) reinforcement-learning-an-introduction-solutions and rl-sandbox (41 vs 34) reinforcement-learning and reinforcement-learning-sutton-barto (33 vs 30) reinforcement-learning and rl_sutton_barto (33 vs 22) reinforcement-learning-an-introduction-solutions and rl_sutton_barto (41 vs 22) rl-sandbox and rl_sutton_barto (34 vs 22)