simmediumatarimetric · varies

On Minimizing Adversarial Counterfactual Error in Adversarial RL

Description

Deep Reinforcement Learning (DRL) policies are highly susceptible to adversarial noise in observations, which poses significant risks in safety-critical scenarios. The challenge inherent to adversarial perturbations is that by altering the information observed by the agent, the state becomes only partially observable. Existing approaches address this by either enforcing consistent actions across nearby states or maximizing the worst-case value within adversarially perturbed observations. However

Source

http://arxiv.org/abs/2406.04724v4