mees/hulc2

[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data

/ 100

Emerging

This project helps robotics engineers train robots to understand and perform complex tasks using natural language instructions. It takes in large datasets of robot interaction, including visual data and some language annotations, and outputs a trained robot policy capable of executing multi-step commands in real-world scenarios. Robotics researchers and developers who are building intelligent robotic systems would find this useful.

No commits in the last 6 months.

Use this if you need to efficiently train a robotic arm to follow abstract, multi-step natural language commands, especially with limited language-annotated data.

Not ideal if your robot tasks are simple, repetitive, or don't require high-level language understanding and generalization.

robot-learning robot-control natural-language-processing robot-manipulation human-robot-interaction

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

andyzeng/apc-vision-toolbox

MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object...

OSU-NLP-Group/UGround

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Ewenwan/MVision

机器人视觉移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习无人驾驶

leggedrobotics/wild_visual_navigation

Wild Visual Navigation: A system for fast traversability learning via pre-trained models and...

microsoft/event-vae-rl

Visuomotor policies from event-based cameras through representation learning and reinforcement...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights