simmediumoffline-rlmetric · varies

LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation

Description

Execution-aware LLM agents offer a promising paradigm for learning from tool feedback, but such feedback is often expensive and slow to obtain, making online reinforcement learning (RL) impractical. High-coverage hardware verification exemplifies this challenge due to its reliance on industrial simulators and non-differentiable execution signals. We propose LLM4Cov, an offline agent-learning framework that models verification as memoryless state transitions guided by deterministic evaluators. Bu

Source

http://arxiv.org/abs/2602.16953v2