← Back to Benchmarks
simmediumrlmetric · varies

Generalization in Online Reinforcement Learning for Mobile Agents

Description

Graphical user interface (GUI)-based mobile agents automate digital tasks on mobile devices by interpreting natural-language instructions and interacting with the screen. While recent methods apply reinforcement learning (RL) to train vision-language-model(VLM) agents in interactive environments with a primary focus on performance, generalization remains underexplored due to the lack of standardized benchmarks and open-source RL systems. In this work, we formalize the problem as a Contextual Mar

Source

http://arxiv.org/abs/2603.07432v1