DHDev0/Muzero-unplugged

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

/ 100

Emerging

This project helps machine learning researchers and reinforcement learning practitioners train AI agents for complex environments, even when full simulators are unavailable or too slow. It takes expert demonstrations or previously generated agent experiences as input and produces a trained AI model capable of making decisions and achieving goals within a given environment. It's designed for those developing advanced AI for games, simulations, or control tasks.

No commits in the last 6 months.

Use this if you need to train a reinforcement learning agent for environments where you can provide expert play data or leverage past agent experiences, reducing reliance on real-time simulation.

Not ideal if you are a beginner in reinforcement learning, as this is an advanced implementation of a specific algorithm rather than a general-purpose introductory tool.

reinforcement-learning AI-training game-AI offline-RL decision-making

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

GPL-3.0

Higher-rated alternatives

jonathan-laurent/AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

NeymarL/ChineseChess-AlphaZero

Implement AlphaZero/AlphaGo Zero methods on Chinese chess.

suragnair/alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial +...

werner-duvaud/muzero-general

MuZero

mokemokechicken/reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Explore ML Frameworks

All categories Trending ML Framework directory Insights