Image captioning in different languages

van Miltenburg, Emiel

Computer Science > Computation and Language

arXiv:2407.09495 (cs)

[Submitted on 31 May 2024 (v1) , last revised 2 Apr 2025 (this version, v3)]

Title: Image captioning in different languages

Title: 不同语言的图像描述

Authors:Emiel van Miltenburg

Abstract: This short position paper provides a manually curated list of non-English image captioning datasets (as of May 2024). Through this list, we can observe the dearth of datasets in different languages: only 23 different languages are represented. With the addition of the Crossmodal-3600 dataset (Thapliyal et al., 2022, 36 languages) this number increases somewhat, but still this number is small compared to the +/-500 institutional languages that are out there. This paper closes with some open questions for the field of Vision & Language.

Abstract: 这篇简短的立场文件提供了一份手动整理的非英语图像字幕数据集列表（截至 2024 年 5 月）。通过这份列表，我们可以看出不同语言的数据集非常匮乏：仅有 23 种不同的语言。加上 Crossmodal-3600 数据集（Thapliyal 等人，2022 年，包含 36 种语言），这个数字略有增加，但与现有的 +/-500 种机构语言相比，这个数字仍然很小。本文最后提出了一些关于视觉与语言领域的开放性问题。

Subjects:	Computation and Language (cs.CL) ; Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.09495 [cs.CL]
	(or arXiv:2407.09495v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.09495

Submission history

From: Emiel van Miltenburg [view email]
[v1] Fri, 31 May 2024 09:37:54 UTC (56 KB)
[v2] Wed, 30 Oct 2024 11:57:22 UTC (50 KB)
[v3] Wed, 2 Apr 2025 19:27:35 UTC (53 KB)

Computer Science > Computation and Language

Title: Image captioning in different languages

Title: 不同语言的图像描述

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title: Image captioning in different languages Show Chinese title

Title: 不同语言的图像描述

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Title: Image captioning in different languages