simmediumoffline-rlmetric · varies

Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model

Description

Uncertainty quantification is a critical aspect of reinforcement learning and deep learning, with numerous applications ranging from efficient exploration and stable offline reinforcement learning to outlier detection in medical diagnostics. The scale of modern neural networks, however, complicates the use of many theoretically well-motivated approaches such as full Bayesian inference. Approximate methods like deep ensembles can provide reliable uncertainty estimates but still remain computation

Source

http://arxiv.org/abs/2503.11339v2