← Back to Benchmarks
simmediumvision-robotmetric · varies

Yolo-Key-6D: Single Stage Monocular 6D Pose Estimation with Keypoint Enhancements

Description

Estimating the 6D pose of objects from a single RGB image is a critical task for robotics and extended reality applications. However, state-of-the-art multi stage methods often suffer from high latency, making them unsuitable for real time use. In this paper, we present Yolo-Key-6D, a novel single stage, end-to-end framework for monocular 6D pose estimation designed for both speed and accuracy. Our approach enhances a YOLO based architecture by integrating an auxiliary head that regresses the 2D

Source

http://arxiv.org/abs/2603.03879v1