CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting

/ 100

Experimental

This project helps energy traders or market participants in a day-ahead electricity market learn how to optimally bid for products. By analyzing market conditions like oil prices and weather, it helps determine the best price and quantity to bid to maximize profit. It takes market state information as input and provides an optimal bidding strategy as output for multiple competing agents.

No commits in the last 6 months.

Use this if you are a market participant in a competitive market and need to develop an automated, data-driven strategy to optimize your bidding policy against other hidden competitors.

Not ideal if your market is not competitive or if you have full visibility into your competitors' actions.

energy-trading market-bidding price-optimization competitive-strategy day-ahead-market

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

Toni-SM/skrl

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with...

facebookresearch/BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL...

utiasDSL/gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

datamllab/rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

gtri/scrimmage

Multi-Agent Robotics Simulator

Explore AI Agents

All categories Trending AI Agent directory Insights