simmediumrlmetric · varies

Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control

Description

We present GuidedSAC, a novel reinforcement learning (RL) algorithm that facilitates efficient exploration in vast state-action spaces. GuidedSAC leverages large language models (LLMs) as intelligent supervisors that provide action-level guidance for the Soft Actor-Critic (SAC) algorithm. The LLM-based supervisor analyzes the most recent trajectory using state information and visual replays, offering action-level interventions that enable targeted exploration. Furthermore, we provide a theoretic

Source

http://arxiv.org/abs/2603.17468v1