phy-q/benchmark

Phy-Q: A Testbed for Physical Reasoning

/ 100

Emerging

This project provides a benchmark to test how well AI agents understand and react to real-world physics, similar to how humans or robots do. It takes an AI agent as input and evaluates its ability to solve tasks in a simulated environment based on 15 physical scenarios (like rolling, falling, or structural stability). The output is a "Phy-Q score" that measures the agent's physical reasoning intelligence. This is for AI researchers and developers working on intelligent agents for robotics or other physical interaction systems.

No commits in the last 6 months.

Use this if you are developing or evaluating AI agents that need to reason about physical interactions and make decisions in dynamic environments.

Not ideal if you are looking for a general-purpose AI benchmark that doesn't focus specifically on physical reasoning in simulated environments.

robotics AI agent development physical simulation intelligent systems reasoning evaluation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

lululxvi/deepxde

A library for scientific machine learning and physics-informed learning

pnnl/neuromancer

Pytorch-based framework for solving parametric constrained optimization problems,...

wilsonrljr/sysidentpy

A Python Package For System Identification Using NARMAX Models

dynamicslab/pysindy

A package for the sparse identification of nonlinear dynamical systems from data

google-research/torchsde

Differentiable SDE solvers with GPU support and efficient sensitivity analysis.

Explore ML Frameworks

All categories Trending ML Framework directory Insights