ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
This project helps machine learning researchers implement the Trust Region Policy Optimization (TRPO) algorithm for training AI agents. You provide a simulation environment, and it outputs a learned policy that enables an agent to perform tasks within that environment. This is for researchers and practitioners in reinforcement learning who need to experiment with or apply this specific policy optimization method.
450 stars. No commits in the last 6 months.
Use this if you are a reinforcement learning researcher specifically interested in implementing or evaluating the Trust Region Policy Optimization (TRPO) algorithm using PyTorch.
Not ideal if you are looking for the latest, state-of-the-art policy optimization method, as a newer variant (PPO) is generally recommended instead.
Stars
450
Forks
91
Language
Python
License
MIT
Category
Last pushed
Sep 13, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ikostrikov/pytorch-trpo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...
Denys88/rl_games
RL implementations
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
yandexdataschool/Practical_RL
A course in reinforcement learning in the wild