simmediumgraspingmetric · varies

GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation

Description

We present GrasMolmo, a generalizable open-vocabulary task-oriented grasping (TOG) model. GraspMolmo predicts semantically appropriate, stable grasps conditioned on a natural language instruction and a single RGB-D frame. For instance, given "pour me some tea", GraspMolmo selects a grasp on a teapot handle rather than its body. Unlike prior TOG methods, which are limited by small datasets, simplistic language, and uncluttered scenes, GraspMolmo learns from PRISM, a novel large-scale synthetic da

Source

http://arxiv.org/abs/2505.13441v3