Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > cs > arXiv:2308.16298

Help | Advanced Search

Computer Science > Cryptography and Security

arXiv:2308.16298 (cs)
[Submitted on 30 Aug 2023 (v1) , last revised 28 Jul 2025 (this version, v3)]

Title: Publishing Wikipedia usage data with strong privacy guarantees

Title: 以强大的隐私保证发布维基百科使用数据

Authors:Temilola Adeleye, Skye Berghel, Damien Desfontaines, Michael Hay, Isaac Johnson, Cléo Lemoisson, Ashwin Machanavajjhala, Tom Magerlein, Gabriele Modena, David Pujol, Daniel Simmons-Marengo, Hal Triedman
Abstract: For almost 20 years, the Wikimedia Foundation has been publishing statistics about how many people visited each Wikipedia page on each day. This data helps Wikipedia editors determine where to focus their efforts to improve the online encyclopedia, and enables academic research. In June 2023, the Wikimedia Foundation, helped by Tumult Labs, addressed a long-standing request from Wikipedia editors and academic researchers: it started publishing these statistics with finer granularity, including the country of origin in the daily counts of page views. This new data publication uses differential privacy to provide robust guarantees to people browsing or editing Wikipedia. This paper describes this data publication: its goals, the process followed from its inception to its deployment, the algorithms used to produce the data, and the outcomes of the data release.
Abstract: 近20年来,维基媒体基金会一直在发布关于每天每个维基百科页面有多少人访问的统计数据。 这些数据帮助 维基百科编辑确定他们应该将精力集中在哪些地方以改进在线百科全书,并促进了学术研究。 2023年6月,维基媒体 基金会,在Tumult Labs的帮助下,回应了维基百科编辑和学术研究人员长期以来的请求:它开始以更细粒度的方式发布这些统计数据,包括在每日页面浏览次数中加入来源国家。 这项新的数据发布使用差分隐私来为浏览或编辑维基百科的人提供强大的保障。 本文描述了这一数据发布:其目标,从开始到部署的过程,用于生成数据的算法,以及数据发布的成果。
Comments: 11 pages, 10 figures, Theory and Practice of Differential Privacy (TPDP) 2023
Subjects: Cryptography and Security (cs.CR)
Cite as: arXiv:2308.16298 [cs.CR]
  (or arXiv:2308.16298v3 [cs.CR] for this version)
  https://doi.org/10.48550/arXiv.2308.16298
arXiv-issued DOI via DataCite

Submission history

From: Hal Triedman [view email]
[v1] Wed, 30 Aug 2023 19:58:56 UTC (135 KB)
[v2] Fri, 1 Sep 2023 18:07:46 UTC (135 KB)
[v3] Mon, 28 Jul 2025 16:40:24 UTC (114 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
license icon view license
Current browse context:
cs.CR
< prev   |   next >
new | recent | 2023-08
Change to browse by:
cs

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号