Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > stat > arXiv:1911.03405v1

Help | Advanced Search

Statistics > Machine Learning

arXiv:1911.03405v1 (stat)
[Submitted on 8 Nov 2019 ]

Title: Theoretical Guarantees for Model Auditing with Finite Adversaries

Title: 模型审计的理论保证与有限对手

Authors:Mario Diaz, Peter Kairouz, Jiachun Liao, Lalitha Sankar
Abstract: Privacy concerns have led to the development of privacy-preserving approaches for learning models from sensitive data. Yet, in practice, even models learned with privacy guarantees can inadvertently memorize unique training examples or leak sensitive features. To identify such privacy violations, existing model auditing techniques use finite adversaries defined as machine learning models with (a) access to some finite side information (e.g., a small auditing dataset), and (b) finite capacity (e.g., a fixed neural network architecture). Our work investigates the requirements under which an unsuccessful attempt to identify privacy violations by a finite adversary implies that no stronger adversary can succeed at such a task. We do so via parameters that quantify the capabilities of the finite adversary, including the size of the neural network employed by such an adversary and the amount of side information it has access to as well as the regularity of the (perhaps privacy-guaranteeing) audited model.
Abstract: 隐私问题导致了从敏感数据中学习模型的隐私保护方法的发展。 然而,在实际中,即使具有隐私保证的模型也可能无意中记住独特的训练示例或泄露敏感特征。 为了识别此类隐私违规行为,现有的模型审计技术使用定义为具有(a)对某些有限辅助信息(例如,一个小的审计数据集)的访问权限,以及(b)有限能力(例如,固定的神经网络架构)的机器学习模型的有限对手。 我们的工作研究了在何种条件下,有限对手未能识别隐私违规尝试意味着没有更强的对手能够成功完成此类任务。 我们通过量化有限对手能力的参数来实现这一点,包括有限对手使用的神经网络的大小、它能访问的辅助信息量以及被审计模型(可能是具有隐私保证的模型)的规则性。
Comments: 18 pages, 1 figure
Subjects: Machine Learning (stat.ML) ; Machine Learning (cs.LG)
Cite as: arXiv:1911.03405 [stat.ML]
  (or arXiv:1911.03405v1 [stat.ML] for this version)
  https://doi.org/10.48550/arXiv.1911.03405
arXiv-issued DOI via DataCite

Submission history

From: Mario Diaz [view email]
[v1] Fri, 8 Nov 2019 17:39:00 UTC (43 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • TeX Source
view license
Current browse context:
stat.ML
< prev   |   next >
new | recent | 2019-11
Change to browse by:
cs
cs.LG
stat

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号