Grid World Q-Learning ML Frameworks
Educational implementations of Q-learning and reinforcement learning algorithms applied to grid-based navigation environments. Includes projects demonstrating value iteration, SARSA, and related RL techniques in controlled simulations. Does NOT include general reinforcement learning frameworks, complex game AI, or non-grid-based environments.
There are 43 grid world q-learning frameworks tracked. 2 score above 50 (established tier). The highest-rated is Farama-Foundation/stable-retro at 58/100 with 354 stars.
Get all 43 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=grid-world-q-learning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
Farama-Foundation/stable-retro
Retro games for Reinforcement Learning Research |
|
Established |
| 2 |
MatPoliquin/stable-retro-scripts
Train models on retro games. AI vs AI contest. Pytorch C++ plugin for... |
|
Established |
| 3 |
nadavbh12/Retro-Learning-Environment
The Retro Learning Environment (RLE) -- a learning framework for AI |
|
Emerging |
| 4 |
svpino/cs7641-assignment4
CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes |
|
Emerging |
| 5 |
Avalon-Benchmark/avalon
A 3D video game environment and benchmark designed from scratch for... |
|
Emerging |
| 6 |
abhinavcreed13/ai-reinforcement-learning
This project will implement value iteration and Q-learning. It will first... |
|
Emerging |
| 7 |
F-a-b-r-i-z-i-o/maze-game
Q-learning project where an agent learns by himself to find the exit inside... |
|
Emerging |
| 8 |
PhilippOesch/QuickRLGym.js
A small TypeScript/ JavaScript Framework for implementing, testing, and... |
|
Emerging |
| 9 |
LukaszMalucha/Pathfinder-AI
Flask RESTful App with Q-learning AI algorithm |
|
Experimental |
| 10 |
tristansokol/Bobcats
Code used for the OpenAI Retro Contest for team Bobcats, spring 2018 |
|
Experimental |
| 11 |
pcluteijn/MazeLearning
Q-Learing algorithm solves simple mazes. |
|
Experimental |
| 12 |
vlfom/StepLearn
Applying Markov Decision Processes and Q-Learning to a robot movement model |
|
Experimental |
| 13 |
RodneyShag/GridWorldMDP
Uses Markov decision processes (MDPs) and Temporal Difference (TD)... |
|
Experimental |
| 14 |
htglvl/Final_prj
Auto CS2 bot using reinforcement learning |
|
Experimental |
| 15 |
gaut2172/TreasureHuntGame
AI pathfinding project using deep reinforcement learning. Deep Q-learning... |
|
Experimental |
| 16 |
EricChen0104/ppo-icm-maze-exploration
A curiosity-driven PPO + ICM reinforcement learning agent for autonomous... |
|
Experimental |
| 17 |
thanhld94/Reinforcement-Learning
Reinforcement Learning and Adaptive Resonance Theory |
|
Experimental |
| 18 |
mbeps/custom-q-learning-agent
Custom Reinforcement Learning agent built using Q-Learning. This is a... |
|
Experimental |
| 19 |
LeoMartinezTAMUK/Markov_Decision_Process
This project implements a Markov Decision Process (MDP) using Reinforcement... |
|
Experimental |
| 20 |
ms-daniel/Reinforcement-Learning
Program created in java with swing interface. |
|
Experimental |
| 21 |
gtoubassi/SimpleReinforcementLearning
A demonstration of table based, SARSA reinforcement learning for a simple... |
|
Experimental |
| 22 |
tctibbs/matchbox-rl
Matchbox-RL: A tangible reinforcement learning library for Python. Based on... |
|
Experimental |
| 23 |
HarunaHaju/RL-STG
Play STG with Reinforcement Learning |
|
Experimental |
| 24 |
bzznrc/rl-toybox
Tiny collection of minimal RL arcade-style games with shared training code |
|
Experimental |
| 25 |
cabe9/NLBProject
Reproducible NLB'21 mc_maze benchmark with lagged PCA latent regression and... |
|
Experimental |
| 26 |
Styleflo/Gridword-DQN
Gridword pour l'entraînement d'un DQN, afin de comprendre son fonctionnement |
|
Experimental |
| 27 |
MariaEduardaSampaio/TP3_Fundamentos-IA
Reinforcement Learning project implementing Q-Learning and optimal policy... |
|
Experimental |
| 28 |
KriChau95/rl-cnn-maze-localization
This project implements and compares strategies for self-localization within... |
|
Experimental |
| 29 |
yaacov/qlearner
Q learning algorithm example in python |
|
Experimental |
| 30 |
masoudslipknot/Reinforcment_Learning_ValueIteration
Reinforcement- Learning project: Value Iteration Implementation. |
|
Experimental |
| 31 |
Abdalkarim1979/Deep-Q-Learning-Maze-Explorer
Educational Q-Learning maze with TensorFlow neural network and Pygame visualization. |
|
Experimental |
| 32 |
PhuongLe/deep-q-learning-robot
An implementation of Reinforcement Learning using the Q-Learning algorithm... |
|
Experimental |
| 33 |
AlanDoesCS/Easy-Java-RL-Library
A simple RL library, with a focus on DQNs |
|
Experimental |
| 34 |
Ojaswy/Reinforcement-Learning
Implementation of Reinforcement Learning |
|
Experimental |
| 35 |
KaranAnchan/MDP_Optimal_Grid_Sim
Explore the Gridworld Simulation 🌍🚀! An agent navigates a 5x5 grid to... |
|
Experimental |
| 36 |
sababot/q-learning
A Q-Learning implementation in C++, ready to be tweaked and modified to new projects |
|
Experimental |
| 37 |
asaMavhungu/Maze-runner
Pytorch-based FourRooms Environment with Epsilon-Greedy Policy |
|
Experimental |
| 38 |
solar-san/AIDA-Project
Final assignment for a university project, name of the course: Artificial... |
|
Experimental |
| 39 |
dizys/nyu-ai-lab-3
NYU Artificial Intelligence Course Lab 3: A generic Markov process solver. |
|
Experimental |
| 40 |
Aatamte/gridworld-economy
a gym-style API for training/evaluating agents in an economy |
|
Experimental |
| 41 |
KaranAnchan/Windy_GridWorld_Sim
🌍💨 Explore the Windy Gridworld! An agent navigates a 10x7 grid facing wind... |
|
Experimental |
| 42 |
itsamirrezah/MazeQLearning
Implementation of Q-Learning Algorithm |
|
Experimental |
| 43 |
manassarpatwar/QLearningJS
An implementation of Q-Learning algorithm in JavaScript |
|
Experimental |