simmediumrlmetric · varies

TED: Training-Free Experience Distillation for Multimodal Reasoning

Description

Knowledge distillation is typically realized by transferring a teacher model's knowledge into a student's parameters through supervised or reinforcement-based optimization. While effective, such approaches require repeated parameter updates and large-scale training data, limiting their applicability in resource-constrained environments. In this work, we propose TED, a training-free, context-based distillation framework that shifts the update target of distillation from model parameters to an in-

Source

http://arxiv.org/abs/2603.26778v1