← Back to Benchmarks
simmediummobile-manipulationmetric · varies
Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations
Description
Robust grasping in cluttered, unstructured environments remains challenging for mobile legged manipulators due to occlusions that lead to partial observations, unreliable depth estimates, and the need for collision-free, execution-feasible approaches. In this paper we present an end-to-end pipeline for language-guided grasping that bridges open-vocabulary target selection to safe grasp execution on a real robot. Given a natural-language command, the system grounds the target in RGB using open-vo