asystemoffields/disco-torch
A PyTorch port of DeepMind's Disco103 — the meta-learned reinforcement learning update rule from Discovering State-of-the-art Reinforcement Learning Algorithms (Nature, 2025).
This project offers a pre-trained neural network, Disco103, that replaces traditional reinforcement learning loss functions. You feed it your agent's experiences, and it outputs specific loss targets, allowing your agent to learn by minimizing the difference. This tool is for AI researchers and practitioners who develop and train autonomous agents and want to simplify or improve their reinforcement learning pipelines.
Available on PyPI.
Use this if you are training a reinforcement learning agent and want to use a state-of-the-art, meta-learned update rule instead of manually designing complex loss functions like PPO or GRPO.
Not ideal if you need full transparency into the exact mathematical formulation of your loss function or if your project specifically requires fine-grained control over every aspect of the learning update rule.
Stars
9
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 08, 2026
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/asystemoffields/disco-torch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...
Denys88/rl_games
RL implementations
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
yandexdataschool/Practical_RL
A course in reinforcement learning in the wild