← Back to Benchmarks
simmediummanipulationmetric · varies

STORM: Slot-based Task-aware Object-centric Representation for robotic Manipulation

Description

Visual foundation models provide strong perceptual features for robotics, but their dense representations lack explicit object-level structure, limiting robustness and contractility in manipulation tasks. We propose STORM (Slot-based Task-aware Object-centric Representation for robotic Manipulation), a lightweight object-centric adaptation module that augments frozen visual foundation models with a small set of semantic-aware slots for robotic manipulation. Rather than retraining large backbones

Source

http://arxiv.org/abs/2601.20381v1