← Back to Benchmarks
simmediummanipulation-datametric · varies
Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation
Description
Grasping is one of the most fundamental challenging capabilities in robotic manipulation, especially in unstructured, cluttered, and semantically diverse environments. Recent researches have increasingly explored language-guided manipulation, where robots not only perceive the scene but also interpret task-relevant natural language instructions. However, existing language-conditioned grasping methods typically rely on shallow fusion strategies, leading to limited semantic grounding and weak alig