← Back to Benchmarks
simmediumrlmetric · varies

Evidence-based Distributional Alignment for Large Language Models

Description

Distributional alignment enables large language models (LLMs) to predict how a target population distributes its responses across answer options, rather than collapsing disagreement into a single consensus answer. However, existing LLM-based distribution prediction is often unstable and degrades under cultural and domain shift. Token score-based estimates can change with minor option wording or formatting, response sampling-based estimates are expensive and sensitive to prompts and decoding sett

Source

http://arxiv.org/abs/2603.13305v1