← Back to Benchmarks
simmediumrlmetric · varies

ICPRL: Acquiring Physical Intuition from Interactive Control

Description

VLMs excel at static perception but falter in interactive reasoning in dynamic physical environments, which demands planning and adaptation to dynamic outcomes. Existing physical reasoning methods often depend on abstract symbolic inputs or lack the ability to learn and adapt from direct, pixel-based visual interaction in novel scenarios. We introduce ICPRL (In-Context Physical Reinforcement Learning), a framework inspired by In-Context Reinforcement Learning (ICRL) that empowers VLMs to acquire

Source

http://arxiv.org/abs/2603.13295v1