Integrating learner corpora and natural language processing: A crucial step towards reconciling technological sophistication and pedagogical effectiveness1

Sylviane Granger; Olivier Kraif; Claude Ponton; Georges Antoniadis; Virginie Zampa

doi:10.1017/S0958344007000237

Abstract

Learner corpora, electronic collections of spoken or written data from foreign language learners, offer unparalleled access to many hitherto uncovered aspects of learner language, particularly in their error-tagged format. This article aims to demonstrate the role that the learner corpus can play in CALL, particularly when used in conjunction with web-based interfaces which provide flexible access to error-tagged corpora that have been enhanced with simple NLP techniques such as POS-tagging or lemmatization and linked to a wide range of learner and task variables such as mother tongue background or activity type. This new resource is of interest to three main types of users: teachers wishing to prepare pedagogical materials that target learners' attested difficulties; learners themselves for editing or language awareness purposes and NLP researchers, for whom it serves as a benchmark for testing automatic error detection systems.

Information

Type

Research Article

Information

ReCALL , Volume 19 , Issue 3 , September 2007 , pp. 252 - 268

DOI: https://doi.org/10.1017/S0958344007000237

References

¹ The research reported in this article is part of a wider project on Integrated Digital Language Learning (IDILL) carried out within the framework of the EU-funded network of excellence Kaleidoscope dedicated to research in the field of technology-enhanced learning: http://www.noe-kaleidoscope.org/pub/

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Harrer, Andreas Martínez-Monès, Alejandra and Dimitracopoulou, Angelique 2009. Technology-Enhanced Learning. p. 175.

LEVY, MIKE 2009. Technologies in Use for Second Language Learning. The Modern Language Journal, Vol. 93, Issue. s1, p. 769.

Antoniadis, Georges Granger, Sylviane Kraif, Olivier Ponton, Claude Medori, Julia and Zampa, Virginie 2009. Technology-Enhanced Learning. p. 89.

VYATKINA, NINA 2012. The Development of Second Language Writing Complexity in Groups and Individuals: A Longitudinal Learner Corpus Study. The Modern Language Journal, Vol. 96, Issue. 4, p. 576.

Granger, Sylviane 2012. The Encyclopedia of Applied Linguistics.

VYATKINA, NINA 2013. Specific Syntactic Complexity: Developmental Profiling of Individuals Based on an Annotated Learner Corpus. The Modern Language Journal, Vol. 97, Issue. S1, p. 11.

Karlström, Petter and Lundin, Eva 2013. CALL in the zone of proximal development: novelty effects and teacher guidance. Computer Assisted Language Learning, Vol. 26, Issue. 5, p. 412.

Campillos Llanos, Leonardo 2014. A Spanish learner oral corpus for computer-aided error analysis. Corpora, Vol. 9, Issue. 2, p. 207.

Cotos, Elena 2014. Enhancing writing pedagogy with learner corpus data. ReCALL, Vol. 26, Issue. 2, p. 202.

Urzua, Alfredo 2015. Corpus-based Research in Applied Linguistics. Vol. 66, Issue. , p. 99.

Alexopoulou, Theodora Geertzen, Jeroen Korhonen, Anna and Meurers, Detmar 2015. Exploring big educational learner corpora for SLA research. International Journal of Learner Corpus Research, Vol. 1, Issue. 1, p. 96.

Zou, Bin and Peng, Wangheng 2015. Corpus Linguistics in Chinese Contexts. p. 134.

Campillos Llanos, Leonardo 2016. Spanish Learner Corpus Research. Vol. 78, Issue. , p. 89.

Ha, Myung-Jeong 2016. Linking adverbials in first-year Korean university EFL learners' writing: a corpus-informed analysis. Computer Assisted Language Learning, Vol. 29, Issue. 6, p. 1090.

Zou, Bin and Reinders, Hayo 2017. Innovation in Language Learning and Teaching. p. 245.

Alexopoulou, Theodora Michel, Marije Murakami, Akira and Meurers, Detmar 2017. Task Effects on Linguistic Complexity and Accuracy: A Large‐Scale Learner Corpus Analysis Employing Natural Language Processing Techniques. Language Learning, Vol. 67, Issue. S1, p. 180.

Pérez-Paredes, Pascual Ordoñana Guillamón, Carlos and Aguado Jiménez, Pilar 2018. Language teachers’ perceptions on the use of OER language processing technologies in MALL. Computer Assisted Language Learning, Vol. 31, Issue. 5-6, p. 522.

Chen, Xiaobin and Meurers, Detmar 2019. Linking text readability and learner proficiency using linguistic complexity feature vector distance. Computer Assisted Language Learning, Vol. 32, Issue. 4, p. 418.

Preradovic, Nives Mikelic and Posavec, Kristina 2019. Opening Up Education for Inclusivity Across Digital Economies and Societies. p. 73.

Bicking, Sabine Steinhoff-Knopp, Bastian Burkhard, Benjamin and Müller, Felix 2020. Quantification and mapping of the nutrient regulation ecosystem service demand on a local scale. Ecosystems and People, Vol. 16, Issue. 1, p. 114.

Download full list

Article contents

Integrating learner corpora and natural language processing: A crucial step towards reconciling technological sophistication and pedagogical effectiveness1

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Integrating learner corpora and natural language processing: A crucial step towards reconciling technological sophistication and pedagogical effectiveness1

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests