← Back to Benchmarks
simmediumatarimetric · varies

Planning and Learning Using Adaptive Entropy Tree Search

Description

Recent breakthroughs in Artificial Intelligence have shown that the combination of tree-based planning with deep learning can lead to superior performance. We present Adaptive Entropy Tree Search (ANTS) - a novel algorithm combining planning and learning in the maximum entropy paradigm. Through a comprehensive suite of experiments on the Atari benchmark we show that ANTS significantly outperforms PUCT, the planning component of the state-of-the-art AlphaZero system. ANTS builds upon recent work

Source

http://arxiv.org/abs/2102.06808v3