← Back to Benchmarks
simmediumpolicy-learningmetric · varies

Efficient Camera Pose Augmentation for View Generalization in Robotic Policy Learning

Description

Prevailing 2D-centric visuomotor policies exhibit a pronounced deficiency in novel view generalization, as their reliance on static observations hinders consistent action mapping across unseen views. In response, we introduce GenSplat, a feed-forward 3D Gaussian Splatting framework that facilitates view-generalized policy learning through novel view rendering. GenSplat employs a permutation-equivariant architecture to reconstruct high-fidelity 3D scenes from sparse, uncalibrated inputs in a sing

Source

http://arxiv.org/abs/2603.29192v1