Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > cs > arXiv:2501.05220

Help | Advanced Search

Computer Science > Computers and Society

arXiv:2501.05220 (cs)
[Submitted on 9 Jan 2025 ]

Title: A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education

Title: 一种可扩展且自动的主题控制问答生成的新方法

Authors:Ziqing Li, Mutlu Cukurova, Sahan Bulathwela
Abstract: The development of Automatic Question Generation (QG) models has the potential to significantly improve educational practices by reducing the teacher workload associated with creating educational content. This paper introduces a novel approach to educational question generation that controls the topical focus of questions. The proposed Topic-Controlled Question Generation (T-CQG) method enhances the relevance and effectiveness of the generated content for educational purposes. Our approach uses fine-tuning on a pre-trained T5-small model, employing specially created datasets tailored to educational needs. The research further explores the impacts of pre-training strategies, quantisation, and data augmentation on the model's performance. We specifically address the challenge of generating semantically aligned questions with paragraph-level contexts, thereby improving the topic specificity of the generated questions. In addition, we introduce and explore novel evaluation methods to assess the topical relatedness of the generated questions. Our results, validated through rigorous offline and human-backed evaluations, demonstrate that the proposed models effectively generate high-quality, topic-focused questions. These models have the potential to reduce teacher workload and support personalised tutoring systems by serving as bespoke question generators. With its relatively small number of parameters, the proposals not only advance the capabilities of question generation models for handling specific educational topics but also offer a scalable solution that reduces infrastructure costs. This scalability makes them feasible for widespread use in education without reliance on proprietary large language models like ChatGPT.
Abstract: 自动问题生成(QG)模型的发展有望通过减少与创建教育内容相关的教师工作量,显著改善教育实践。 本文介绍了一种新的教育问题生成方法,该方法控制问题的主题焦点。 所提出的主题控制问题生成(T-CQG)方法提高了生成内容的相关性和教育目的的有效性。 我们的方法在预训练的T5-small模型上进行微调,使用专门为教育需求定制的数据集。 研究进一步探讨了预训练策略、量化和数据增强对模型性能的影响。 我们特别解决了生成与段落级上下文语义对齐的问题的挑战,从而提高了生成问题的主题特异性。 此外,我们引入并探索了新颖的评估方法,以评估生成问题的主题相关性。 我们的结果通过严格的离线和人工支持的评估得到验证,表明所提出的模型能够有效生成高质量、主题聚焦的问题。 这些模型有潜力通过作为定制的问题生成器来减少教师的工作量,并支持个性化辅导系统。 由于参数数量相对较少,这些提案不仅增强了问题生成模型处理特定教育主题的能力,还提供了一个可扩展的解决方案,降低了基础设施成本。 这种可扩展性使它们在教育中广泛使用成为可能,而无需依赖像ChatGPT这样的专有大型语言模型。
Comments: To be published at ACM Conf. on Learning Analytics and Knowledge (LAK'25)
Subjects: Computers and Society (cs.CY) ; Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
ACM classes: H.3.3; J.1; I.2.0
Cite as: arXiv:2501.05220 [cs.CY]
  (or arXiv:2501.05220v1 [cs.CY] for this version)
  https://doi.org/10.48550/arXiv.2501.05220
arXiv-issued DOI via DataCite

Submission history

From: Sahan Bulathwela [view email]
[v1] Thu, 9 Jan 2025 13:13:24 UTC (3,090 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
license icon view license
Current browse context:
cs.CY
< prev   |   next >
new | recent | 2025-01
Change to browse by:
cs
cs.AI
cs.CL
cs.IR

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号