← Back to Benchmarks
simmediumimitationmetric · varies
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
Description
Diffusion Policies have become widely used in Imitation Learning, offering several appealing properties, such as generating multimodal and discontinuous behavior. As models are becoming larger to capture more complex capabilities, their computational demands increase, as shown by recent scaling laws. Therefore, continuing with the current architectures will present a computational roadblock. To address this gap, we propose Mixture-of-Denoising Experts (MoDE) as a novel policy for Imitation Learn