← Back to Benchmarks
simmediumnavigationmetric · varies

WebATLAS: An LLM Agent with Experience-Driven Memory and Action Simulation

Description

Large Language Model (LLM) web agents often struggle with long-horizon web navigation and web task completion in new websites, producing inefficient action sequences unless fine-tuned on environment-specific data. We show that experience-driven memory, combined with look-ahead action simulation, is sufficient for LLM agents to adapt to unseen web environments by remembering past failures and predicting the consequences of future actions. We introduce WebATLAS (Actor-Critic Task-completion with L

Source

http://arxiv.org/abs/2510.22732v2