← Back to Benchmarks
simmediummobile-manipulationmetric · varies
OVAL-Grasp: Open-Vocabulary Affordance Localization for Task Oriented Grasping
Description
To manipulate objects in novel, unstructured environments, robots need task-oriented grasps that target object parts based on the given task. Geometry-based methods often struggle with visually defined parts, occlusions, and unseen objects. We introduce OVAL-Grasp, a zero-shot open-vocabulary approach to task-oriented, affordance based grasping that uses large-language models and vision-language models to allow a robot to grasp objects at the correct part according to a given task. Given an RGB