← Back to Benchmarks
simmediumroboticsmetric · varies
Event-Driven Proactive Assistive Manipulation with Grounded Vision-Language Planning
Description
Assistance in collaborative manipulation is often initiated by user instructions, making high-level reasoning request-driven. In fluent human teamwork, however, partners often infer the next helpful step from the observed outcome of an action rather than waiting for instructions. Motivated by this, we introduce a shift from request-driven assistance to event-driven proactive assistance, where robot actions are initiated by workspace state transitions induced by human--object interactions rather