← Back to Benchmarks
simmediumvision-robotmetric · varies

SignSparK: Efficient Multilingual Sign Language Production via Sparse Keyframe Learning

Description

Generating natural and linguistically accurate sign language avatars remains a formidable challenge. Current Sign Language Production (SLP) frameworks face a stark trade-off: direct text-to-pose models suffer from regression-to-the-mean effects, while dictionary-retrieval methods produce robotic, disjointed transitions. To resolve this, we propose a novel training paradigm that leverages sparse keyframes to capture the true underlying kinematic distribution of human signing. By predicting dense

Source

http://arxiv.org/abs/2603.10446v3