← Back to Benchmarks
simmediumroboticsmetric · varies

OMNI-PoseX: A Fast Vision Model for 6D Object Pose Estimation in Embodied Tasks

Description

Accurate 6D object pose estimation is a fundamental capability for embodied agents, yet remains highly challenging in open-world environments. Many existing methods often rely on closed-set assumptions or geometry-agnostic regression schemes, limiting their generalization, stability, and real-time applicability in robotic systems. We present OMNI-PoseX, a vision foundation model that introduces a novel network architecture unifying open-vocabulary perception with an SO(3)-aware reflected flow ma

Source

http://arxiv.org/abs/2604.02759v1