← Back to Benchmarks
simmediumvision-robotmetric · varies

3DSPA: A 3D Semantic Point Autoencoder for Evaluating Video Realism

Description

AI video generation is evolving rapidly. For video generators to be useful for applications ranging from robotics to film-making, they must consistently produce realistic videos. However, evaluating the realism of generated videos remains a largely manual process -- requiring human annotation or bespoke evaluation datasets which have restricted scope. Here we develop an automated evaluation framework for video realism which captures both semantics and coherent 3D structure and which does not req

Source

http://arxiv.org/abs/2602.20354v1