SPEECH-TO-TEXT TECHNOLOGY AS A TOOL FOR PRONUNCIATION ASSESSMENT AND SELF-CORRECTION IN EFL CONTEXTS: A PILOT STUDY

James Henry

Authors

James Henry Doshisha University

Keywords:

Speech recognition technology, English pronunciation assessment, Computer-Assisted Language Learning, pronunciation feedback, EFL instruction

Abstract

This pilot study investigates the efficacy of speech-to-text (STT) technology as an objective feedback mechanism for pronunciation assessment and self-correction in English as a Foreign Language (EFL) classroom. The research involved 75 Japanese university students across three classes engaging with STT applications for two 25-minute sessions in a single semester. Data collected included successful transcription rates, specific pronunciation feature improvements, and student affective responses. Results indicate that STT software provides immediate, objective feedback that complements traditional pronunciation instruction methods, with students demonstrating measurable improvement in transcription success rates (from 38% to 62%) and reporting increased confidence in their speaking abilities. This pilot study contributes to the growing field of Computer-Assisted Language Learning (CALL) by offering a practical, accessible approach to pronunciation feedback that can be implemented with minimal technological requirements and time investment

Downloads

Download data is not yet available.

References

Aoyama, K., Flege, J. E., Guion, S. G., Akahane-Yamada, R., & Yamada, T. (2004). Perceived phonetic dissimilarity and L2 speech learning: The case of Japanese /r/ and English /l/ and /r/. Journal of Phonetics, 32(2), 233-250.

Baker, A., & Burri, M. (2016). Feedback on second language pronunciation: A case study of EAP teachers' beliefs and practices. Australian Journal of Teacher Education, 41(6), 1-19.

Bradlow, A. R., Akahane-Yamada, R., Pisoni, D. B., & Tohkura, Y. (2001). Training Japanese listeners to identify English /r/ and /l/: Long-term retention of learning in perception and production. Perception & Psychophysics, 61(5), 977-985.

Celce-Murcia, M., Brinton, D. M., & Goodwin, J. M. (2010). Teaching pronunciation: A course book and reference guide (2nd ed.). Cambridge University Press.

Dörnyei, Z. (2007). Research methods in applied linguistics. Oxford University Press.

Dörnyei, Z. (2009). The L2 motivational self system. In Z. Dörnyei & E. Ushioda (Eds.), Motivation, language identity and the L2 self (pp. 9-42). Multilingual Matters.

Evers, K., & Chen, S. (2022). Effects of an automatic speech recognition system with peer feedback on pronunciation instruction for adults. Computer Assisted Language Learning, 35(10), 1869-1889.

Inceoglu, S., Chen, W. H., & Lim, H. (2023). Assessment of L2 intelligibility: Comparing L1 listeners and automatic speech recognition. ReCALL, 35(1), 89-104.

John, P., Johnson, C., & Cardoso, W. (2025). Exploring automatic speech recognition for corrective and confirmative pronunciation feedback. Journal of Second Language Pronunciation.

Kartchava, E., & Ammar, A. (2014). The noticeability and effectiveness of corrective feedback in relation to target type. Language Teaching Research, 18(4), 428-452.

Lee, J., Jang, J., & Plonsky, L. (2015). The effectiveness of second language pronunciation instruction: A meta-analysis. Applied Linguistics, 36(3), 345-366.

Levis, J. M. (2007). Computer technology in teaching and researching pronunciation. Annual Review of Applied Linguistics, 27, 184-202.

Levis, J. M. (2018). Intelligibility, oral communication, and the teaching of pronunciation. Cambridge University Press.

Liu, Y., Rahman, F. A., & Zain, F. M. (2025). A systematic literature review of research on automatic speech recognition in EFL pronunciation. Cogent Education, 12(1).

Lyster, R., & Saito, K. (2010). Oral feedback in classroom SLA: A meta-analysis. Studies in Second Language Acquisition, 32(2), 265-302.

McCrocklin, S. M. (2016). Pronunciation learner autonomy: The potential of automatic speech recognition. System, 57, 25-42.

McCrocklin, S. M. (2019). ASR-based dictation practice for second language pronunciation improvement. Journal of Second Language Pronunciation, 5(1), 98-118.

Mroz, A. (2018). Seeing how people hear you: French learners experiencing intelligibility through automatic speech recognition. Foreign Language Annals, 51(3), 617-637.

Munro, M. J., & Derwing, T. M. (2015). A prospectus for pronunciation research in the 21st century: A point of view. Journal of Second Language Pronunciation, 1(1), 11-42.

Neri, A., Cucchiarini, C., & Strik, H. (2008). The effectiveness of computer-based speech corrective feedback for improving segmental quality in L2 Dutch. ReCALL, 20(2), 225-243.

O'Brien, M. G., Derwing, T. M., Cucchiarini, C., Hardison, D. M., Mixdorff, H., Thomson, R. I., Strik, H., Levis, J. M., Munro, M. J., Foote, J. A., & Levis, G. M. (2018). Directions for the future of technology in pronunciation research and teaching. Journal of Second Language Pronunciation, 4(2), 182-207.

Rogti, M. (2025). Automatic pronunciation evaluation feedback and peer correction for shaping English pronunciation accuracy and interpersonal communication. Innovation in Language Learning and Teaching, 1-17.

Sardegna, V. G. (2011). Pronunciation learning strategies that improve ESL learners' linking. In J. Levis & K. LeVelle (Eds.), Proceedings of the 2nd Pronunciation in Second Language Learning and Teaching Conference (pp. 105-121). Iowa State University.

Sun, C. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology.

Thomson, R. I., & Derwing, T. M. (2015). The effectiveness of L2 pronunciation instruction: A narrative review. Applied Linguistics, 36(3), 326-344.

Ueyama, M. (2000). Prosodic transfer: An acoustic study of L2 English vs. L2 Japanese. University of California, Los Angeles.

Wallace, L. (2016). Using Google Web Speech as a springboard for identifying personal pronunciation problems. Proceedings of the 7th Pronunciation in Second Language Learning and Teaching Conference, 180-186.

Xiao, Y. (2025). The impact of AI-driven speech recognition on EFL listening comprehension, flow experience, and anxiety: A randomized controlled trial. Humanities and Social Sciences Communications, 12(1), 425.

SPEECH-TO-TEXT TECHNOLOGY AS A TOOL FOR PRONUNCIATION ASSESSMENT AND SELF-CORRECTION IN EFL CONTEXTS: A PILOT STUDY

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

Make a Submission

Menu

Article Template

SupportingTools

Indexing

Visitors

GS