Private Counterfactual Retrieval

Nomeir, Mohamed; Dissanayake, Pasan; Meel, Shreya; Dutta, Sanghamitra; Ulukus, Sennur

计算机科学 > 信息论

arXiv:2410.13812 (cs)

[提交于 2024年10月17日 (v1) ，最后修订 2025年7月24日 (此版本， v2)]

标题：私有反事实检索

标题： Private Counterfactual Retrieval

Authors:Mohamed Nomeir, Pasan Dissanayake, Shreya Meel, Sanghamitra Dutta, Sennur Ulukus

摘要：透明性和可解释性是在高风险应用中使用黑盒机器学习模型时需要考虑的两个极其重要的方面。提供反事实解释是满足这一要求的一种方式。然而，这也对提供解释的机构以及请求解释的用户的隐私构成威胁。在本工作中，我们提出了多种受私有信息检索（PIR）技术启发的方案，以确保在检索反事实解释时的\emph{用户隐私}。我们提出了一种方案，可以从接受点的数据库中检索出\emph{精确的}最近邻的反事实解释，同时为用户实现完美的（信息论意义上的）隐私。虽然该方案为用户实现了完美的隐私，但数据库上的某些泄露是不可避免的，我们使用基于互信息的度量来量化这种泄露。此外，我们提出了减少这种泄露的策略，以实现更高程度的数据库隐私。我们将这些方案扩展以纳入用户对其属性转换的偏好，从而获得更具操作性的解释。由于我们的方案依赖于有限域算术，我们在真实数据集上对方案进行了实证验证，以了解准确性和有限域大小之间的权衡。最后，我们展示了数值结果以支持我们的理论发现，并比较了所提方案的数据库泄露情况。

摘要： Transparency and explainability are two extremely important aspects to be considered when employing black-box machine learning models in high-stake applications. Providing counterfactual explanations is one way of fulfilling this requirement. However, this also poses a threat to the privacy of both the institution that is providing the explanation as well as the user who is requesting it. In this work, we propose multiple schemes inspired by private information retrieval (PIR) techniques which ensure the \emph{user's privacy} when retrieving counterfactual explanations. We present a scheme which retrieves the \emph{exact} nearest neighbor counterfactual explanation from a database of accepted points while achieving perfect (information-theoretic) privacy for the user. While the scheme achieves perfect privacy for the user, some leakage on the database is inevitable which we quantify using a mutual information based metric. Furthermore, we propose strategies to reduce this leakage to achieve an advanced degree of database privacy. We extend these schemes to incorporate user's preference on transforming their attributes, so that a more actionable explanation can be received. Since our schemes rely on finite field arithmetic, we empirically validate our schemes on real datasets to understand the trade-off between the accuracy and the finite field sizes. Finally, we present numerical results to support our theoretical findings, and compare the database leakage of the proposed schemes.

主题：	信息论 (cs.IT) ; 密码学与安全 (cs.CR); 机器学习 (cs.LG); 信号处理 (eess.SP)
引用方式：	arXiv:2410.13812 [cs.IT]
	(或者 arXiv:2410.13812v2 [cs.IT] 对于此版本)
	https://doi.org/10.48550/arXiv.2410.13812

提交历史

来自： Mohamed Nomeir [查看电子邮件]
[v1] 星期四， 2024 年 10 月 17 日 17:45:07 UTC (223 KB)
[v2] 星期四， 2025 年 7 月 24 日 17:25:40 UTC (150 KB)

计算机科学 > 信息论

标题：私有反事实检索

标题： Private Counterfactual Retrieval

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 信息论

标题： 私有反事实检索 显示英文标题

标题： Private Counterfactual Retrieval

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：私有反事实检索