kohjingyu/search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

/ 100

Emerging

This project helps AI researchers evaluate how well language model agents can navigate and perform multi-step tasks in complex, interactive online environments like e-commerce sites or social media platforms. You provide access to various simulated web environments and API keys for large language models, and the system outputs performance metrics and detailed agent trajectories. This is ideal for researchers studying advanced AI planning and exploration with language models.

220 stars. No commits in the last 6 months.

Use this if you are an AI researcher developing and testing language model agents for complex, multi-step tasks in web environments and need a robust framework for evaluation.

Not ideal if you are looking for a ready-to-use tool to automate web tasks for business or personal use, as this is a research framework for agent development.

AI-research language-model-evaluation agent-planning web-automation-research interactive-environments

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

220

Forks

Language

Python

License

MIT

Higher-rated alternatives

FutureAGI/Xenoverse

Benchmarking general decision-making with open & random worlds

helios-base/helios-base

HELIOS base is a sample implementation of the simulated soccer team for the RoboCup Soccer Simulation.

google-deepmind/lab

A customisable 3D platform for agent-based AI research

helios-base/librcsc

A base library to develop a simulated soccer team for the RoboCup Soccer Simulation

google-deepmind/lab2d

A customisable 2D platform for agent-based AI research

Explore AI Agents

All categories Trending AI Agent directory Insights