bigai-nlco/langsuite
Official Repo of LangSuitE
This project helps evaluate how well large language models (LLMs) can control and interact within text-based environments for tasks like navigation and object manipulation. It takes in configurations for agents and environments, and a chosen LLM, to simulate and assess its performance in these 'embodied' scenarios. Researchers and AI developers working on improving LLM interaction with virtual worlds would find this useful.
No commits in the last 6 months.
Use this if you need a systematic way to test and benchmark large language models on their ability to perform actions and communicate in simulated text-based environments without a visual simulator.
Not ideal if you are looking for a visual simulation environment or a tool to directly deploy LLMs into real-world physical robots.
Stars
84
Forks
3
Language
Python
License
MIT
Category
Last pushed
Aug 15, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/bigai-nlco/langsuite"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
inclusionAI/AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
melih-unsal/DemoGPT
🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.
AOSSIE-Org/Perspective
Perspective analyzes your news or social feed and presents credible counter-narratives from...
expectedparrot/edsl
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social...
kaushikb11/awesome-llm-agents
A curated list of awesome LLM agents frameworks.