Evaluating the scoring system of an AI-integrated app to assess foreign language phonological decoding

[thumbnail of Open Access]
Preview
Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.
[thumbnail of Accepted Manuscript (with figures).pdf]
Text
- Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Turner, J., Porter, A., Graham, S. ORCID: https://orcid.org/0000-0002-7743-3977, Ralph-Donaldson, T., Kruesemann, H., Zhang, P. ORCID: https://orcid.org/0000-0002-2136-4984 and Borthwick, K. (2025) Evaluating the scoring system of an AI-integrated app to assess foreign language phonological decoding. Research Methods in Applied Linguistics, 4 (3). 100257. ISSN 2772-7661 doi: 10.1016/j.rmal.2025.100257

Abstract/Summary

Phonological decoding in a foreign language (FL)—a two-part process involving first the ability to map written symbols to their corresponding sounds and second to pronounce them intelligibly—is foundational for reading and vocabulary acquisition. Yet assessing this skill efficiently and at scale in young learners remains a persistent challenge. Here, we introduce and evaluate the accuracy and effectiveness of a novel method for assessing FL phonological decoding using an AI-driven app that automatically scores children's pronunciation of symbol-sound correspondences. In a study involving 254 learners of French and Spanish (aged 10–11) across five UK primary schools, pupils completed a read-aloud task (14 symbol-sound correspondences) that was scored by the app’s automatic speech recognition (ASR) technology. The validity of these automated scores was tested by fitting them as independent variables in regression models predicting human auditory coding. The multiple significant relationships between automated and human scores that were established indicate that there is great potential for ASR-based tools to reliably assess phonological decoding in this population. These findings provide the first large-scale empirical validation of an AI-based assessment of FL decoding in children, opening new possibilities, applicable to a range of languages being learnt, for scalable and efficient assessment.

Altmetric Badge

Item Type Article
URI https://centaur.reading.ac.uk/id/eprint/124064
Identification Number/DOI 10.1016/j.rmal.2025.100257
Refereed Yes
Divisions Arts, Humanities and Social Science > Institute of Education > Language and Literacy in Education
Publisher Elsevier
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record