ikostrikov/pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

/ 100

Established

This project helps machine learning researchers implement the Trust Region Policy Optimization (TRPO) algorithm for training AI agents. You provide a simulation environment, and it outputs a learned policy that enables an agent to perform tasks within that environment. This is for researchers and practitioners in reinforcement learning who need to experiment with or apply this specific policy optimization method.

450 stars. No commits in the last 6 months.

Use this if you are a reinforcement learning researcher specifically interested in implementing or evaluating the Trust Region Policy Optimization (TRPO) algorithm using PyTorch.

Not ideal if you are looking for the latest, state-of-the-art policy optimization method, as a newer variant (PPO) is generally recommended instead.

reinforcement-learning AI-agent-training policy-optimization robotics-simulation machine-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

450

Forks

Language

Python

License

MIT

Related frameworks

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...

Denys88/rl_games

RL implementations

pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

yandexdataschool/Practical_RL

A course in reinforcement learning in the wild

Explore ML Frameworks

All categories Trending ML Framework directory Insights