bin123apple/InfantAgent

[NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.

36
/ 100
Emerging

InfantAgent is a tool for AI researchers to train and evaluate AI agents that can interact with a computer like a human. It takes in an AI model and provides a virtual desktop environment where the agent can control the mouse and keyboard to perform tasks. The output is a trained AI agent capable of autonomous computer operation.

Use this if you are an AI researcher developing and testing advanced multimodal agents that need to interact directly with a computer's desktop interface.

Not ideal if you are looking for a ready-to-use application to automate personal computer tasks or a simple browser automation tool.

AI-agent-development reinforcement-learning multimodal-AI AI-model-training human-computer-interaction-automation
No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 3 / 25

How are scores calculated?

Stars

35

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Feb 25, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/bin123apple/InfantAgent"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.