simmediumoffline-rlmetric · varies

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Description

Graphical User Interface (GUI) agents have demonstrated remarkable progress in automating complex user interface interactions through reinforcement learning. However, current approaches face a fundamental dilemma: offline RL enables stable training on pre-collected trajectories, but struggles with multi-step task execution for lack of trajectory-level reward signals; online RL captures these signals through environment interaction, but suffers from sparse rewards and prohibitive deployment costs

Source

http://arxiv.org/abs/2509.11543v2