Generalizing while preserving monotonicity in comparison-based preference learning models

Fageot, Julien; Blanchard, Peva; Bareilles, Gilles; Hoang, Lê-Nguyên

数学 > 统计理论

arXiv:2506.08616v1 (math)

[提交于 2025年6月10日 ]

标题：在基于比较的偏好学习模型中推广并保持单调性

标题： Generalizing while preserving monotonicity in comparison-based preference learning models

Authors:Julien Fageot, Peva Blanchard, Gilles Bareilles, Lê-Nguyên Hoang

摘要：如果你告诉一个学习模型你更喜欢备选项$a$而不是另一个备选项$b$，那么你可能期望该模型是单调的，也就是说，$a$的估值增加，而$b$的估值减少。然而，也许令人惊讶的是，许多广泛部署的基于比较的偏好学习模型，包括大型语言模型，都无法保证这一点。直到现在，唯一被证明具有单调性的基于比较的偏好学习算法是广义布拉德利-特雷模型（Generalized Bradley-Terry 模型）。然而，这些模型无法推广到未比较的数据。在本文中，我们深入研究了具备泛化能力且单调的模型集合。具体来说，我们提出了一类新的带有扩散先验的线性广义布拉德利-特雷模型，并确定了关于备选项嵌入的充分条件，以确保单调性。我们的实验表明，这种单调性远非普遍保证，并且我们提出的这一类能够推广的新模型提高了准确性，尤其是在数据集有限的情况下。

摘要： If you tell a learning model that you prefer an alternative $a$ over another alternative $b$, then you probably expect the model to be monotone, that is, the valuation of $a$ increases, and that of $b$ decreases. Yet, perhaps surprisingly, many widely deployed comparison-based preference learning models, including large language models, fail to have this guarantee. Until now, the only comparison-based preference learning algorithms that were proved to be monotone are the Generalized Bradley-Terry models. Yet, these models are unable to generalize to uncompared data. In this paper, we advance the understanding of the set of models with generalization ability that are monotone. Namely, we propose a new class of Linear Generalized Bradley-Terry models with Diffusion Priors, and identify sufficient conditions on alternatives' embeddings that guarantee monotonicity. Our experiments show that this monotonicity is far from being a general guarantee, and that our new class of generalizing models improves accuracy, especially when the dataset is limited.

主题：	统计理论 (math.ST) ; 机器学习 (cs.LG)
引用方式：	arXiv:2506.08616 [math.ST]
	(或者 arXiv:2506.08616v1 [math.ST] 对于此版本)
	https://doi.org/10.48550/arXiv.2506.08616

提交历史

来自： Gilles Bareilles [查看电子邮件]
[v1] 星期二， 2025 年 6 月 10 日 09:24:08 UTC (122 KB)

数学 > 统计理论

标题：在基于比较的偏好学习模型中推广并保持单调性

标题： Generalizing while preserving monotonicity in comparison-based preference learning models

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

数学 > 统计理论

标题： 在基于比较的偏好学习模型中推广并保持单调性 显示英文标题

标题： Generalizing while preserving monotonicity in comparison-based preference learning models

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：在基于比较的偏好学习模型中推广并保持单调性