← Back to Benchmarks
simmediumroboticsmetric · varies

LEAP: LLM-Generation of Egocentric Action Programs

Description

We introduce LEAP (illustrated in Figure 1), a novel method for generating video-grounded action programs through use of a Large Language Model (LLM). These action programs represent the motoric, perceptual, and structural aspects of action, and consist of sub-actions, pre- and post-conditions, and control flows. LEAP's action programs are centered on egocentric video and employ recent developments in LLMs both as a source for program knowledge and as an aggregator and assessor of multimodal vid

Source

http://arxiv.org/abs/2312.00055v1