← Back to Benchmarks
simmediumsim-to-realmetric · varies

Direct Contact-Tolerant Motion Planning With Vision Language Models

Description

Navigation in cluttered environments often requires robots to tolerate contact with movable or deformable objects to maintain efficiency. Existing contact-tolerant motion planning (CTMP) methods rely on indirect spatial representations (e.g., prebuilt map, obstacle set), resulting in inaccuracies and a lack of adaptiveness to environmental uncertainties. To address this issue, we propose a direct contact-tolerant (DCT) planner, which integrates vision-language models (VLMs) into direct point per

Source

http://arxiv.org/abs/2603.05017v1