← Back to Benchmarks
simmediumroboticsmetric · varies

Uncovering Linguistic Fragility in Vision-Language-Action Models via Diversity-Aware Red Teaming

Description

Vision-Language-Action (VLA) models have achieved remarkable success in robotic manipulation. However, their robustness to linguistic nuances remains a critical, under-explored safety concern, posing a significant safety risk to real-world deployment. Red teaming, or identifying environmental scenarios that elicit catastrophic behaviors, is an important step in ensuring the safe deployment of embodied AI agents. Reinforcement learning (RL) has emerged as a promising approach in automated red tea

Source

http://arxiv.org/abs/2604.05595v1