← Back to Benchmarks
simmediumroboticsmetric · varies

Cortical Policy: A Dual-Stream View Transformer for Robotic Manipulation

Description

View transformers process multi-view observations to predict actions and have shown impressive performance in robotic manipulation. Existing methods typically extract static visual representations in a view-specific manner, leading to inadequate 3D spatial reasoning ability and a lack of dynamic adaptation. Taking inspiration from how the human brain integrates static and dynamic views to address these challenges, we propose Cortical Policy, a novel dual-stream view transformer for robotic manip

Source

http://arxiv.org/abs/2603.21051v1