simmediumrlmetric · varies

Hierarchical Lead Critic based Multi-Agent Reinforcement Learning

Description

Cooperative Multi-Agent Reinforcement Learning (MARL) solves complex tasks that require coordination from multiple agents, but is often limited to either local (independent learning) or global (centralized learning) perspectives. In this paper, we introduce a novel sequential training scheme and MARL architecture, which learns from multiple perspectives on different hierarchy levels. We propose the Hierarchical Lead Critic (HLC) - inspired by natural emerging distributions in team structures, wh

Source

http://arxiv.org/abs/2602.21680v1