← Back to Benchmarks
simmediumroboticsmetric · varies

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning

Description

Understanding the world from distributed, partial viewpoints is a fundamental challenge for embodied multi-agent systems. Each agent perceives the environment through an ego-centric view that is often limited by occlusion and ambiguity. To study this problem, we introduce the Ego-to-World (E2W) benchmark, which evaluates a vision-language model's ability to fuse heterogeneous viewpoints across three tasks: (i) global counting, (ii) relational location reasoning, and (iii) action-oriented graspin

Source

http://arxiv.org/abs/2603.14811v1