simmediumrlmetric · varies

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Description

Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to open-ended self-evolution toward superintelligent systems. In this paper, we propos

Source

http://arxiv.org/abs/2602.21320v1