← Back to Benchmarks
simmediumroboticsmetric · varies
Uncovering Linguistic Fragility in Vision-Language-Action Models via Diversity-Aware Red Teaming
Description
Vision-Language-Action (VLA) models have achieved remarkable success in robotic manipulation. However, their robustness to linguistic nuances remains a critical, under-explored safety concern, posing a significant safety risk to real-world deployment. Red teaming, or identifying environmental scenarios that elicit catastrophic behaviors, is an important step in ensuring the safe deployment of embodied AI agents. Reinforcement learning (RL) has emerged as a promising approach in automated red tea