simmediumimitationmetric · varies

Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning

Description

Imitation learning is a promising approach for enabling generalist capabilities in humanoid robots, but its scaling is fundamentally constrained by the scarcity of high-quality expert demonstrations. This limitation can be mitigated by leveraging suboptimal, open-ended play data, often easier to collect and offering greater diversity. This work builds upon recent advances in generative modeling, specifically Flow Matching, an alternative to Diffusion models. We introduce a method for estimating

Source

http://arxiv.org/abs/2505.19717v2