← Back to Benchmarks
simmediumroboticsmetric · varies
HCLSM: Hierarchical Causal Latent State Machines for Object-Centric World Modeling
Description
World models that predict future states from video remain limited by flat latent representations that entangle objects, ignore causal structure, and collapse temporal dynamics into a single scale. We present HCLSM, a world model architecture that operates on three interconnected principles: object-centric decomposition via slot attention with spatial broadcast decoding, hierarchical temporal dynamics through a three-level engine combining selective state space models for continuous physics, spar