← Back to Benchmarks
simmediumroboticsmetric · varies

Deep Learning Aided Vision System for Planetary Rovers

Description

This study presents a vision system for planetary rovers, combining real-time perception with offline terrain reconstruction. The real-time module integrates CLAHE enhanced stereo imagery, YOLOv11n based object detection, and a neural network to estimate object distances. The offline module uses the Depth Anything V2 metric monocular depth estimation model to generate depth maps from captured images, which are fused into dense point clouds using Open3D. Real world distance estimates from the rea

Source

http://arxiv.org/abs/2603.26802v1