← Back to Benchmarks
simmediummanipulation-datametric · varies
Learning Semantic-Geometric Task Graph-Representations from Human Demonstrations
Description
Learning structured task representations from human demonstrations is essential for understanding long-horizon manipulation behaviors, particularly in bimanual settings where action ordering, object involvement, and interaction geometry can vary significantly. A key challenge lies in jointly capturing the discrete semantic structure of tasks and the temporal evolution of object-centric geometric relations in a form that supports reasoning over task progression. In this work, we introduce a seman