Accessibility navigation


Navigating the fog: the effectiveness of personalised conversational GenAI models for supporting ancient language learning

Ross, E. A. S. ORCID: https://orcid.org/0000-0003-4174-835X and Baines, J. (2025) Navigating the fog: the effectiveness of personalised conversational GenAI models for supporting ancient language learning. AI & Antiquity, 1 (1). pp. 35-52. ISSN 3081-4553

[thumbnail of 2025-09-24 - AI & Antiquity Ross-Baines-35-52.pdf] Text - Published Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.
· Available under License Creative Commons Attribution Non-commercial.

1MB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

To link to this item DOI: 10.64946/aiantiquity.v1i1.002

Abstract/Summary

Hallucinations (misleading, inaccurate predicted text presented as fact) are a critical problem for using generative artificial intelligence (GenAI) tools to support ancient language teaching and learning. For a teacher, significant editing time is required to correct any inaccuracies or misrepresentations prior to making use of AI-generated content to support their teaching practice. For students, these convincing errors may not be recognised, and this may lead to misconceptions in their knowledge formation. OpenAI and Google released public-facing, customizable conversational AI models which allow users to upload their own datasets to create personalised AI chat agents, known as GPTs (2023) and Gems (2024) respectively. This presents an opportunity for teachers to personalize their own models to streamline their students’ experiences. However, can personalised conversational AI tools provide a fine-tuned experience that reduces the major, problematic ancient history and ancient language hallucinations that we see in standard ChatGPT and Gemini outputs? This paper discusses the creation of a personalised Latin Tutor GPT and Gem through the development of a series of exhaustive Latin vocabulary spreadsheets. We tested these personalised tools against their standard GenAI counterpart to determine if personalisation improved their efficacy and efficiency for supporting ancient language learning. The development of the spreadsheets and testing process both closely addressed current GenAI ethical issues, including copyright, environmental impact, and content restrictions. The results of these tests found that personalised GPTs and Gems made small efficacy and efficiency improvements, but the time and energy required greatly outweighed the results.

Item Type:Article
Refereed:Yes
Divisions:Arts, Humanities and Social Science > School of Humanities > Classics
ID Code:124613
Uncontrolled Keywords:Ancient Language Learning, Generative Artificial Intelligence, Latin, OpenAI, Gemini, AI Ethics
Publisher:Center for Innovation in Ancient Worlds

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation