qzed/irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

/ 100

Established

This project helps machine learning researchers or roboticists understand the underlying reward function that explains observed expert behavior. It takes demonstrations of an optimal agent's actions in an environment and outputs a reward function that can then be used to train new agents. This is ideal for those working on reinforcement learning tasks where direct reward function design is difficult.

312 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to infer the reward function driving expert demonstrations in a simulated or real-world environment.

Not ideal if you already have a well-defined reward function or are not working with sequential decision-making problems.

reinforcement-learning robotics imitation-learning optimal-control behavior-modeling

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 22 / 25

How are scores calculated?

Stars

312

Forks

Language

Jupyter Notebook

License

MIT

Related frameworks

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...

Denys88/rl_games

RL implementations

pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

yandexdataschool/Practical_RL

A course in reinforcement learning in the wild

Explore ML Frameworks

All categories Trending ML Framework directory Insights