Div-Infinity/XQL

Extreme Q-Learning: Max Entropy RL without Entropy

/ 100

Emerging

This project offers a novel approach to reinforcement learning (RL) problems, particularly those with a continuous range of possible actions. It provides algorithms that take data from environments or prior interactions and produce optimal action-selection strategies. This is ideal for researchers and practitioners in machine learning who are developing or applying advanced RL agents.

No commits in the last 6 months.

Use this if you are working on reinforcement learning tasks with continuous action spaces and need a more efficient and robust way to estimate optimal 'Q-values' or 'soft-values' for policy improvement.

Not ideal if you are new to reinforcement learning or primarily work with discrete action spaces where traditional Q-learning methods suffice.

reinforcement-learning machine-learning-research robotics-control optimal-control AI-algorithms

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

qgallouedec/panda-gym

Set of robotic environments based on PyBullet physics engine and gymnasium.

nicrusso7/rex-gym

OpenAI Gym environments for an open-source quadruped robot (SpotMicro)

amazon-science/auction-gym

AuctionGym is a simulation environment that enables reproducible evaluation of bandit and...

upb-lea/openmodelica-microgrid-gym

OpenModelica Microgrid Gym (OMG): An OpenAI Gym Environment for Microgrids

vietnh1009/Super-mario-bros-A3C-pytorch

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Explore ML Frameworks

All categories Trending ML Framework directory Insights