simmediumimitationmetric · varies

Prediction with Action: Visual Policy Learning via Joint Denoising Process

Description

Diffusion models have demonstrated remarkable capabilities in image generation tasks, including image editing and video creation, representing a good understanding of the physical world. On the other line, diffusion models have also shown promise in robotic control tasks by denoising actions, known as diffusion policy. Although the diffusion generative model and diffusion policy exhibit distinct capabilities--image prediction and robotic action, respectively--they technically follow a similar de

Source

http://arxiv.org/abs/2411.18179v1