simmediumoffline-rlmetric · varies

Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations

Description

Approaches for goal-conditioned reinforcement learning (GCRL) often use learned state representations to extract goal-reaching policies. Two frameworks for representation structure have yielded particularly effective GCRL algorithms: (1) *contrastive representations*, in which methods learn "successor features" with a contrastive objective that performs inference over future outcomes, and (2) *temporal distances*, which link the (quasimetric) distance in representation space to the transit time

Source

http://arxiv.org/abs/2509.20478v2