← Back to Benchmarks
simmediumpolicy-learningmetric · varies

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

Description

Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in embodied AI. Existing approaches either rely on reactive policies that struggle with long-horizon planning, or employ world models that suffer from poor action initialization in high-dimensional spaces. We present PiJEPA, a two-stage framework that combines the strengths of learned navigation policies with latent world model planning for instruction-conditioned visual navigation. In the

Source

http://arxiv.org/abs/2603.25981v1