ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

Fang, Bo; Fan, Jianan; Liu, Dongnan; Chang, Hang; Shami, Gerald J.; Braet, Filip; Cai, Weidong

计算机科学 > 计算机视觉与模式识别

arXiv:2507.17149 (cs)

[提交于 2025年7月23日 ]

标题： ScSAM：消除亚细胞语义分割中的形态偏差和分布变异性

标题： ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

Authors:Bo Fang, Jianan Fan, Dongnan Liu, Hang Chang, Gerald J.Shami, Filip Braet, Weidong Cai

摘要：亚细胞组分之间显著的形态和分布可变性给基于学习的细胞器分割模型带来了长期挑战，显著增加了偏差特征学习的风险。现有方法通常依赖单一映射关系，忽视了特征多样性，从而导致偏差训练。尽管Segment Anything Model（SAM）提供了丰富的特征表示，但其在亚细胞场景中的应用受到两个关键挑战的阻碍：(1) 亚细胞形态和分布的可变性在标签空间中造成了差距，导致模型学习到虚假或有偏差的特征。 (2) SAM专注于全局上下文理解，常常忽略细粒度的空间细节，使得捕捉细微结构变化和处理偏态数据分布变得困难。为了解决这些挑战，我们引入了ScSAM，这是一种通过融合预训练SAM与掩码自编码器（MAE）引导的细胞先验知识来增强特征鲁棒性的方法，以缓解数据不平衡带来的训练偏差。具体而言，我们设计了一个特征对齐和融合模块，将预训练嵌入对齐到相同的特征空间，并高效地结合不同的表示。此外，我们提出了一种基于余弦相似度矩阵的类别提示编码器，以激活特定类别的特征来识别亚细胞类别。在多种亚细胞图像数据集上的大量实验表明，ScSAM优于最先进的方法。

摘要： The significant morphological and distributional variability among subcellular components poses a long-standing challenge for learning-based organelle segmentation models, significantly increasing the risk of biased feature learning. Existing methods often rely on single mapping relationships, overlooking feature diversity and thereby inducing biased training. Although the Segment Anything Model (SAM) provides rich feature representations, its application to subcellular scenarios is hindered by two key challenges: (1) The variability in subcellular morphology and distribution creates gaps in the label space, leading the model to learn spurious or biased features. (2) SAM focuses on global contextual understanding and often ignores fine-grained spatial details, making it challenging to capture subtle structural alterations and cope with skewed data distributions. To address these challenges, we introduce ScSAM, a method that enhances feature robustness by fusing pre-trained SAM with Masked Autoencoder (MAE)-guided cellular prior knowledge to alleviate training bias from data imbalance. Specifically, we design a feature alignment and fusion module to align pre-trained embeddings to the same feature space and efficiently combine different representations. Moreover, we present a cosine similarity matrix-based class prompt encoder to activate class-specific features to recognize subcellular categories. Extensive experiments on diverse subcellular image datasets demonstrate that ScSAM outperforms state-of-the-art methods.

评论：	被第28届欧洲人工智能大会（ECAI）接受
主题：	计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI); 机器学习 (cs.LG)
ACM 类：	I.4.6
引用方式：	arXiv:2507.17149 [cs.CV]
	(或者 arXiv:2507.17149v1 [cs.CV] 对于此版本)
	https://doi.org/10.48550/arXiv.2507.17149

提交历史

来自： Bo Fang [查看电子邮件]
[v1] 星期三， 2025 年 7 月 23 日 02:28:43 UTC (7,122 KB)

计算机科学 > 计算机视觉与模式识别

标题： ScSAM：消除亚细胞语义分割中的形态偏差和分布变异性

标题： ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 计算机视觉与模式识别

标题： ScSAM：消除亚细胞语义分割中的形态偏差和分布变异性 显示英文标题

标题： ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题： ScSAM：消除亚细胞语义分割中的形态偏差和分布变异性