motokiomura/annealed-q-learning
[ICML 2025] Official code repository for "Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning"
This project offers a new way to train reinforcement learning (RL) agents for tasks with continuous actions, like robotics. It takes your existing actor-critic RL setup and modifies its learning process to achieve faster and more reliable training. This is for machine learning researchers and practitioners who develop and deploy RL agents for complex control problems.
No commits in the last 6 months.
Use this if you are working with continuous action reinforcement learning and want to accelerate training while improving the robustness and performance of your agents, especially in robotics or simulated control environments.
Not ideal if your primary focus is on discrete action spaces or if you are not already familiar with core reinforcement learning concepts and actor-critic methods.
Stars
8
Forks
—
Language
Python
License
MIT
Category
Last pushed
Jun 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/motokiomura/annealed-q-learning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...
Denys88/rl_games
RL implementations
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
yandexdataschool/Practical_RL
A course in reinforcement learning in the wild