← Back to Benchmarks
simmediummanipulationmetric · varies

RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies

Description

Memory is critical for long-horizon and history-dependent robotic manipulation. Such tasks often involve counting repeated actions or manipulating objects that become temporarily occluded. Recent vision-language-action (VLA) models have begun to incorporate memory mechanisms; however, their evaluations remain confined to narrow, non-standardized settings. This limits their systematic understanding, comparison, and progress measurement. To address these challenges, we introduce RoboMME: a large-s

Source

http://arxiv.org/abs/2603.04639v1