← Back to Benchmarks
simmediumroboticsmetric · varies
SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models
Description
Vision-language-action (VLA) models enable robots to follow natural-language instructions grounded in visual observations, but the instruction channel also introduces a critical vulnerability: small textual perturbations can alter downstream robot behavior. Systematic robustness evaluation therefore requires a black-box attacker that can generate minimal yet effective instruction edits across diverse VLA models. To this end, we present SABER, an agent-centric approach for automatically generatin