Hostname: page-component-745bb68f8f-kw2vx Total loading time: 0 Render date: 2025-01-25T23:07:47.287Z Has data issue: false hasContentIssue false

Developing and evaluating an oral skills training website supported by automatic speech recognition technology*

Published online by Cambridge University Press:  05 January 2011

Howard Hao-Jan Chen*
Affiliation:
Department of English, National Taiwan Normal University, He-ping East Road, Section 1, Taipei 10610, Taiwan125 (email: hjchen@ntnu.edu.tw)

Abstract

Oral communication ability has become increasingly important to many EFL students. Several commercial software programs based on automatic speech recognition (ASR) technologies are available but their prices are not affordable for many students. This paper will demonstrate how the Microsoft Speech Application Software Development Kit (SASDK), a free but powerful tool, can be used to develop an oral skills training website for EFL students. This ASR-based website offers six different types of online exercises which allow students to practise their oral skills and obtain immediate feedback on their performance. A group of 25 college students and a group of 35 pre-service English teachers were invited to use the website. Two surveys were conducted to investigate the students’ and the pre-service teachers’ perceptions of this site. The results indicated that most teachers and students enjoyed using this website, which they felt could help improve their English oral skills. They also pointed out that the main strength of the ASR-based learning system is that it offers several different types of exercises which can encourage learners to produce more output in a low-anxiety environment. The major limitations of the website are the insufficient feedback and the challenging standards one must meet in order to achieve a pass mark. These findings can be useful for teachers who are interested in using ASR in teaching and for CALL researchers who aim to develop better ASR-based systems for language learning.

Type
Research Article
Copyright
Copyright © European Association for Computer Assisted Language Learning 2011

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Bernstein, J., Najmi, A.Ehsani, F. (1999) Subarashii: encounters in Japanese spoken language education. CALICO Journal, 16(3): 361384.CrossRefGoogle Scholar
Chen, H.-J. H. (2001) Evaluating five speech recognition programs for ESL learners. Paper presented at the ITMELT 2001 Conference, Hong Kong. http://elc.polyu.edu.hk/conference/papers2001/chen.htm.Google Scholar
Chen, M.-W. (2006) The Impact of Automatic Speech Technology on Contrastive Stress among Adult EFL Learners. Unpublished master's thesis, Da-Yeh University.Google Scholar
Chiu, T., Liou, H.Yeh, Y. (2007) A study of web-based oral activities enhanced by automatic speech recognition for EFL college learning. Computer Assisted Language Learning, 20(3): 209233.CrossRefGoogle Scholar
Chun, D. M. (2007) Come ride the wave: But where is it taking us? CALICO Journal, 24(2): 239252.CrossRefGoogle Scholar
Egan, K., LaRocca, S. (2000) Speech recognition in language learning: A must. Proceedings of InStill 2000. Dundee: University of Abertay, 4–7.Google Scholar
Ehsani, F.Knodt, E. (1998) Speech technology in computer-aided language learning: Strengths and limitations of a new CALL paradigm. Language Learning & Technology, 2(1): 4560.Google Scholar
Eskenazi, M. (1999a) Using automatic speech processing for foreign language pronunciation tutor: Some issues and a prototype. Language learning and Technology, 2(2): 6276.Google Scholar
Eskenazi, M. (1999b) Using a computer in foreign language pronunciation training: What advantages? CALICO Journal, 16: 447469.CrossRefGoogle Scholar
Fan, T.-Y. (2006) A Design for a Personal Dictionary Inquiry System based on Taiwanese Accented English Speech Recognition. Unpublished master's thesis, National Cheng-Kung University.Google Scholar
Hardison, D. (2005) Contextualized computer-based L2 prosody training: Evaluating the effects of discourse context and video input. CALICO Journal, 22: 175190.CrossRefGoogle Scholar
Harless, W. G., Zier, M. A.Duncan, R. C. (1999) Virtual dialogues with native speakers: The evaluation of an interactive multimedia method. CALICO Journal, 16: 313337.CrossRefGoogle Scholar
Hincks, R. (2003) Speech technologies for pronunciation feedback and evaluation. ReCALL, 15(1): 320.CrossRefGoogle Scholar
Holland, V. M., Kaplan, J. D.Sabol, M. A. (1999) Preliminary tests of language learning in a speech-interactive graphics microworld. CALICO Journal, 16: 339359.CrossRefGoogle Scholar
LaRocca, S. A., Morgan, J. J.Bellinger, S. M. (1999) On the path to 2X learning: Exploring the possibilities of advanced speech recognition. CALICO Journal, 16: 295310.CrossRefGoogle Scholar
Liao, C.-F. (2009) EFL Learners’ Use of Contrastive Stress Supported with Automatic Speech Analysis System. Unpublished master's thesis, Da-Yeh University.Google Scholar
Mostow, J.Aist, G. (1999) Giving help and praise in a reading tutor with imperfect listening–because automated speech recognition means never being able to say you're certain. CALICO Journal, 16: 407424.CrossRefGoogle Scholar
Neri, A., Cucchiarini, C., Strik, H. (2001) Effective feedback on L2 pronunciation in ASR-based CALL. Proceedings of the workshop on Computer Assisted Language Learning, Artificial Intelligence in Education Conference. San Antonio, Texas, 40–48.Google Scholar
Neri, A., Cucchiarini, C., Strik, H.Boves, L. (2002) The pedagogy-technology interface in Computer Assisted Pronunciation Training. Computer Assisted Language Learning, 15(5): 441467.CrossRefGoogle Scholar
Neri, A., Cucchiarini, C., Strik, H. (2003) Automatic Speech Recognition for second language learning: How and why it actually works. Proceedings of 15th International Congress of Phonetic Sciences. Barcelona, Spain, 1157–1160.Google Scholar
O'Brien, M. G. (2006) Teaching pronunciation and intonation with computer technology. In: Ducate, L. and Arnold, N. (eds.), Calling on CALL: From theory and research to new directions in foreign language teaching. San Marcos,TX: CALICO, 127148.Google Scholar
Rypa, M. E.Price, P. (1999) VILTS: a tale of two technologies. CALICO Journal, 16(3): 385404.CrossRefGoogle Scholar
Tang, Shih-Min. (2005) Error Pattern Analysis for Computer Assisted English Pronunciation Learning. Unpublished master's thesis, National Cheng-Kung University.Google Scholar
Wachowicz, K. A.Scott, B. (1999) Software that listens: It's not a question of whether, it's a question of how. CALICO Journal, 16: 253276.CrossRefGoogle Scholar