← Back to Benchmarks
simmediumlocomotionmetric · varies

Grounded Gesture Generation: Language, Motion, and Space

Description

Human motion generation has advanced rapidly in recent years, yet the critical problem of creating spatially grounded, context-aware gestures has been largely overlooked. Existing models typically specialize either in descriptive motion generation, such as locomotion and object interaction, or in isolated co-speech gesture synthesis aligned with utterance semantics. However, both lines of work often treat motion and environmental grounding separately, limiting advances toward embodied, communica

Source

http://arxiv.org/abs/2507.04522v1