motokiomura/annealed-q-learning

[ICML 2025] Official code repository for "Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning"

/ 100

Experimental

This project offers a new way to train reinforcement learning (RL) agents for tasks with continuous actions, like robotics. It takes your existing actor-critic RL setup and modifies its learning process to achieve faster and more reliable training. This is for machine learning researchers and practitioners who develop and deploy RL agents for complex control problems.

No commits in the last 6 months.

Use this if you are working with continuous action reinforcement learning and want to accelerate training while improving the robustness and performance of your agents, especially in robotics or simulated control environments.

Not ideal if your primary focus is on discrete action spaces or if you are not already familiar with core reinforcement learning concepts and actor-critic methods.

reinforcement-learning robotics continuous-control machine-learning-research agent-training

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 4 / 25

Maturity 15 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...

Denys88/rl_games

RL implementations

pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

yandexdataschool/Practical_RL

A course in reinforcement learning in the wild

Explore ML Frameworks

All categories Trending ML Framework directory Insights