Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > eess > arXiv:2403.19508v2

Help | Advanced Search

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2403.19508v2 (eess)
[Submitted on 28 Mar 2024 (v1) , last revised 8 Sep 2025 (this version, v2)]

Title: Fairness-Aware Data Augmentation for Cardiac MRI using Text-Conditioned Diffusion Models

Title: 基于文本条件扩散模型的心脏MRI公平感知数据增强

Authors:Grzegorz Skorupko, Richard Osuala, Zuzanna Szafranowska, Kaisar Kushibar, Vien Ngoc Dang, Nay Aung, Steffen E Petersen, Karim Lekadir, Polyxeni Gkontra
Abstract: While deep learning holds great promise for disease diagnosis and prognosis in cardiac magnetic resonance imaging, its progress is often constrained by highly imbalanced and biased training datasets. To address this issue, we propose a method to alleviate imbalances inherent in datasets through the generation of synthetic data based on sensitive attributes such as sex, age, body mass index (BMI), and health condition. We adopt ControlNet based on a denoising diffusion probabilistic model to condition on text assembled from patient metadata and cardiac geometry derived from segmentation masks. We assess our method using a large-cohort study from the UK Biobank by evaluating the realism of the generated images using established quantitative metrics. Furthermore, we conduct a downstream classification task aimed at debiasing a classifier by rectifying imbalances within underrepresented groups through synthetically generated samples. Our experiments demonstrate the effectiveness of the proposed approach in mitigating dataset imbalances, such as the scarcity of diagnosed female patients or individuals with normal BMI level suffering from heart failure. This work represents a major step towards the adoption of synthetic data for the development of fair and generalizable models for medical classification tasks. Notably, we conduct all our experiments using a single, consumer-level GPU to highlight the feasibility of our approach within resource-constrained environments. Our code is available at https://github.com/faildeny/debiasing-cardiac-mri.
Abstract: 虽然深度学习在心脏磁共振成像中的疾病诊断和预后方面具有巨大的潜力,但其进展通常受到高度不平衡和有偏的训练数据集的限制。 为了解决这个问题,我们提出了一种方法,通过基于敏感属性(如性别、年龄、体重指数(BMI)和健康状况)生成合成数据来缓解数据集中的不平衡。 我们采用基于去噪扩散概率模型的ControlNet,对从患者元数据和分割掩码中提取的心脏几何结构组成的文本进行条件设置。 我们通过评估生成图像的真实性,使用已建立的定量指标,在英国生物银行的大队列研究中评估我们的方法。 此外,我们进行了一项下游分类任务,旨在通过合成生成的样本校正代表性不足群体内的不平衡来消除分类器的偏差。 我们的实验表明,所提出的方法在减轻数据集不平衡方面是有效的,例如诊断出的女性患者稀缺或患有正常BMI水平却患有心力衰竭的个体。 这项工作代表了在医疗分类任务中采用合成数据以开发公平且可推广模型的重要一步。 值得注意的是,我们使用单一的消费级GPU进行了所有实验,以突出我们在资源受限环境中的方法可行性。 我们的代码可在 https://github.com/faildeny/debiasing-cardiac-mri 获取。
Subjects: Image and Video Processing (eess.IV) ; Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as: arXiv:2403.19508 [eess.IV]
  (or arXiv:2403.19508v2 [eess.IV] for this version)
  https://doi.org/10.48550/arXiv.2403.19508
arXiv-issued DOI via DataCite

Submission history

From: Grzegorz Skorupko [view email]
[v1] Thu, 28 Mar 2024 15:41:43 UTC (2,948 KB)
[v2] Mon, 8 Sep 2025 09:37:31 UTC (2,592 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
view license
Current browse context:
eess.IV
< prev   |   next >
new | recent | 2024-03
Change to browse by:
cs
cs.CV
cs.LG
eess

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号