Hostname: page-component-745bb68f8f-s22k5 Total loading time: 0 Render date: 2025-01-26T22:24:41.084Z Has data issue: false hasContentIssue false

Review of research on applications of speech recognition technology to assist language learning

Published online by Cambridge University Press:  14 July 2022

Rustam Shadiev
Affiliation:
Nanjing Normal University, China (rustamsh@gmail.com)
Jiawen Liu
Affiliation:
Nanjing Normal University, China (liujw9797@163.com)

Abstract

Speech recognition technology (SRT) is now widely used in education because of its potential to aid learning, particularly language learning. Nevertheless, SRT has received only limited attention in earlier review studies. The present research aimed to address this gap in the field. To this end, 26 articles published in SSCI journals between 2014 and 2020 were selected and reviewed with respect to domain and skills, technology and their application, participants and duration, measures, reported results, and advantages and disadvantages of SRT. The results showed that English received much more attention than any other language, and scholars mostly focused on facilitating pronunciation skills. Dragon Naturally Speaking and Google speech recognition were the most popular technologies, and their most frequent application was providing feedback. According to the results, college students were involved in research more than any other group, most studies were carried out for less than one month, and most scholars administered a questionnaire or pre-/posttest to collect the data. Positive results related to gains in proficiency and student perceptions of SRT were identified. The study revealed that improved affective factors and enhanced language skills were advantages, whereas a low accuracy rate and insufficiency (i.e. lack of some useful features to support learning efficiently) of SRT were disadvantages. Based on the results, the study puts forward several implications and suggestions for educators and researchers in the field.

Type
Research Article
Copyright
© The Author(s), 2022. Published by Cambridge University Press on behalf of European Association for Computer Assisted Language Learning

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Ahn, T. Y. & Lee, S.-M. (2016) User experience of a mobile speaking application with automatic speech recognition for EFL learning. British Journal of Educational Technology, 47(4): 778786. https://doi.org/10.1111/bjet.12354 CrossRefGoogle Scholar
Arcon, N., Klein, P. D. & Dombroski, J. D. (2017) Effects of dictation, speech to text, and handwriting on the written composition of elementary school English language learners. Reading & Writing Quarterly, 33(6): 533548. https://doi.org/10.1080/10573569.2016.1253513 CrossRefGoogle Scholar
Baker, E. A. (2017) Apps, iPads, and literacy: Examining the feasibility of speech recognition in a first-grade classroom. Reading Research Quarterly, 52(3): 291310. https://doi.org/10.1002/rrq.170 CrossRefGoogle Scholar
Bodnar, S., Cucchiarini, C., de Vries, B. P., Strik, H. & van Hout, R. (2017) Learner affect in computerised L2 oral grammar practice with corrective feedback. Computer Assisted Language Learning, 30(3–4): 223246. https://doi.org/10.1080/09588221.2017.1302964 CrossRefGoogle Scholar
Caseiro, N. & Santos, D. (Eds.). (2018). Smart specialization strategies and the role of entrepreneurial universities. Hershey, PA: IGI Global. Available from: https://www.igi-global.com/book/smart-specialization-strategies-role-entrepreneurial/197442 Google Scholar
Cavus, N. & Ibrahim, D. (2017) Learning English using children’s stories in mobile devices. British Journal of Educational Technology, 48(2): 625641. https://doi.org/10.1111/bjet.12427 CrossRefGoogle Scholar
Creswell, J. W. (2014). Educational research: Planning, conducting, and evaluating quantitative. Boston, MA: Pearson Education.Google Scholar
Dalim, C. S. C., Sunar, M. S., Dey, A. & Billinghurst, M. (2020) Using augmented reality with speech input for non-native children’s language learning. International Journal of Human-Computer Studies, 134: 4464. https://doi.org/10.1016/j.ijhcs.2019.10.002 CrossRefGoogle Scholar
de Vries, B. P., Cucchiarini, C., Bodnar, S., Strik, H. & van Hout, R. (2015) Spoken grammar practice and feedback in an ASR-based call system. Computer Assisted Language Learning, 28(6): 550576. https://doi.org/10.1080/09588221.2014.889713 CrossRefGoogle Scholar
Duman, G., Orhon, G. & Gedik, N. (2015) Research trends in mobile assisted language learning from 2000 to 2012. ReCALL, 27(2): 197216. https://doi.org/10.1017/S0958344014000287 CrossRefGoogle Scholar
Ehsani, F. & Knodt, E. (1998) Speech technology in computer-aided language learning: Strengths and limitations of a new CALL paradigm. Language Learning & Technology, 2(1): 5473.Google Scholar
Haug, K. N. & Klein, P. D. (2018) The effect of speech-to-text technology on learning a writing strategy. Reading & Writing Quarterly, 34(1): 4762. https://doi.org/10.1080/10573569.2017.1326014 CrossRefGoogle Scholar
Hsu, L. (2016) An empirical examination of EFL learners’ perceptual learning styles and acceptance of ASR-based computer-assisted pronunciation training. Computer Assisted Language Learning, 29(5): 881900. https://doi.org/10.1080/09588221.2015.1069747 CrossRefGoogle Scholar
Liakin, D., Cardoso, W. & Liakina, N. (2017) Mobilizing instruction in a second-language context: Learners’ perceptions of two speech technologies. Languages, 2(3): 121. https://doi.org/10.3390/languages2030011 CrossRefGoogle Scholar
MacArthur, C. A., & Cavalier, A. R. (2004). Dictation and speech recognition technology as test accommodations. Exceptional Children, 71(1), 4358.CrossRefGoogle Scholar
Matthews, J. & O’Toole, J. M. (2015) Investigating an innovative computer application to improve L2 word recognition from speech. Computer Assisted Language Learning, 28(4): 364382. https://doi.org/10.1080/09588221.2013.864315 CrossRefGoogle Scholar
McCrocklin, S. M. (2016) Pronunciation learner autonomy: The potential of automatic speech recognition. System, 57: 2542. https://doi.org/10.1016/j.system.2015.12.013 CrossRefGoogle Scholar
McKechnie, J., Ahmed, B., Gutierrez-Osuna, R., Monroe, P., McCabe, P. & Ballard, K. J. (2018) Automated speech analysis tools for children’s speech production: A systematic literature review. International Journal of Speech-Language Pathology, 20(6): 583598. https://doi.org/10.1080/17549507.2018.1477991 CrossRefGoogle ScholarPubMed
Mirzaei, M. S., Meshgi, K., Akita, Y. & Kawahara, T. (2017) Partial and synchronized captioning: A new tool to assist learners in developing second language listening skill. ReCALL, 29(2): 178199. https://doi.org/10.1017/S0958344017000039 CrossRefGoogle Scholar
Mroz, A. (2018) Seeing how people hear you: French learners experiencing intelligibility through automatic speech recognition. Foreign Language Annals, 51(3): 617637. https://doi.org/10.1111/flan.12348 CrossRefGoogle Scholar
Oh, E. Y. & Song, D. (2021) Developmental research on an interactive application for language speaking practice using speech recognition technology. Educational Technology Research and Development, 69(2): 861884. https://doi.org/10.1007/s11423-020-09910-1 CrossRefGoogle Scholar
Radha, V. & Vimala, C. (2012) A review on speech recognition challenges and approaches. World of Computer Science and Information Technology Journal, 2(1): 17.Google Scholar
Shadiev, R. & Huang, Y.-M. (2020) Investigating student attention, meditation, cognitive load, and satisfaction during lectures in a foreign language supported by speech-enabled language translation. Computer Assisted Language Learning, 33(3): 301326. https://doi.org/10.1080/09588221.2018.1559863 CrossRefGoogle Scholar
Shadiev, R., Huang, Y.-M. & Hwang, J.-P. (2017) Investigating the effectiveness of speech-to-text recognition applications on learning performance, attention, and meditation. Educational Technology Research and Development, 65(5): 12391261. https://doi.org/10.1007/s11423-017-9516-3 CrossRefGoogle Scholar
Shadiev, R., Hwang, W.-Y., Chen, N.-S. & Huang, Y.-M. (2014) Review of speech-to-text recognition technology for enhancing learning. Journal of Educational Technology & Society, 17(4): 6584.Google Scholar
Shadiev, R., Hwang, W.-Y., Huang, Y.-M. & Liu, C.-J. (2016) Investigating applications of speech-to-text recognition technology for a face-to-face seminar to assist learning of non-native English-speaking participants. Technology, Pedagogy and Education, 25(1): 119134. https://doi.org/10.1080/1475939X.2014.988744 CrossRefGoogle Scholar
Shadiev, R., Sun, A. & Huang, Y.-M. (2019) A study of the facilitation of cross-cultural understanding and intercultural sensitivity using speech-enabled language translation technology. British Journal of Educational Technology, 50(3): 14151433. https://doi.org/10.1111/bjet.12648 CrossRefGoogle Scholar
Shadiev, R., Wang, X., Wu, T.-T. & Huang, Y.-M. (2021) Review of research on technology-supported cross-cultural learning. Sustainability, 13(3): 123. https://doi.org/10.3390/su13031402 CrossRefGoogle Scholar
Shadiev, R., Wu, T.-T., Sun, A. & Huang, Y.-M. (2018) Applications of speech-to-text recognition and computer-aided translation for facilitating cross-cultural learning through a learning activity: Issues and their solutions. Educational Technology Research and Development, 66(1): 191214. https://doi.org/10.1007/s11423-017-9556-8 CrossRefGoogle Scholar
Shadiev, R. & Yang, M. (2020) Review of studies on technology-enhanced language learning and teaching. Sustainability, 12(2): 122. https://doi.org/10.3390/su12020524 CrossRefGoogle Scholar
Tsai, P. (2019) Beyond self-directed computer-assisted pronunciation learning: A qualitative investigation of a collaborative approach. Computer Assisted Language Learning, 32(7): 713744. https://doi.org/10.1080/09588221.2019.1614069 CrossRefGoogle Scholar
van Doremalen, J., Boves, L., Colpaert, J., Cucchiarini, C. & Strik, H. (2016) Evaluating automatic speech recognition-based language learning systems: A case study. Computer Assisted Language Learning, 29(4): 833851. https://doi.org/10.1080/09588221.2016.1167090 CrossRefGoogle Scholar
Wang, Y.-H. & Young, S. S.-C. (2014) A study of the design and implementation of the ASR-based iCASL system with corrective feedback to facilitate English learning. Journal of Educational Technology & Society, 17(2): 219233.Google Scholar
Wang, Y.-H. & Young, S. S.-C. (2015) Effectiveness of feedback for enhancing English pronunciation in an ASR-based CALL system. Journal of Computer Assisted Learning, 31(6): 493504. https://doi.org/10.1111/jcal.12079 CrossRefGoogle Scholar
Xiao, W. & Park, M. (2021) Using automatic speech recognition to facilitate English pronunciation assessment and learning in an EFL context: Pronunciation error diagnosis and pedagogical implications. International Journal of Computer-Assisted Language Learning and Teaching, 11(3): 7491. https://doi.org/10.4018/IJCALLT.2021070105 CrossRefGoogle Scholar
Yu, P., Pan, Y., Li, C., Zhang, Z., Shi, Q., Chu, W., Liu, M. & Zhu, Z. (2016) User-centred design for Chinese-oriented spoken English learning system. Computer Assisted Language Learning, 29(5): 9841000. https://doi.org/10.1080/09588221.2015.1121877 CrossRefGoogle Scholar
Yueh, H.-P., Lin, W., Liu, Y.-L., Shoji, T. & Minoh, M. (2014) The development of an interaction support system for international distance education. IEEE Transactions on Learning Technologies, 7(2): 191196. https://doi.org/10.1109/TLT.2014.2308952 CrossRefGoogle Scholar
Supplementary material: File

Shadiev and Liu supplementary material

Shadiev and Liu supplementary material

Download Shadiev and Liu supplementary material(File)
File 41.9 KB