Harnessing the Power of Corpus Linguistics in Language Education: A Student-Centered Approach
Main Article Content
Abstract
Introduction: Corpus linguistics (CL) has emerged as a valuable approach in language education, yet skepticism persists among future foreign language teachers regarding its necessity and effectiveness. This study investigates whether hands-on experience with corpora and natural language processing (NLP) tools can shift pre-service teachers’ perceptions and enhance their appreciation for corpus-based methods in language teaching.
Methodology: A structured teacher-training course was implemented, during which participants created their own corpora and utilized NLP tools to develop teaching materials. Pre- and post-course questionnaires were administered to assess changes in attitudes towards CL. The chi-square tests were used to analyze the significance of the collected data.
Results: Findings indicated a significant positive shift in perceptions. Engagement with CL tools led to increased appreciation for data-driven methodologies, with participants expressing a greater likelihood of incorporating these tools into their future teaching practices. The chi-square analysis confirmed the statistical significance of these changes.
Conclusion: Practical engagement with CL and related technologies can effectively address initial skepticism among teacher trainees. These results advocate for the inclusion of CL components in teacher training curricula to promote innovative, data-driven language teaching practices and bridge the gap between skepticism and effective application.
Article Details

This work is licensed under a Creative Commons Attribution 4.0 International License.
References
Anthony, L. (2005). AntConc: Design and development of a freeware corpus analysis toolkit for the technical writing classroom. IPCC 2005. Proceedings. International Professional Communication Conference. Limerick, Ireland, (pp. 729-737). DOI: https://doi.org/10.1109/IPCC.2005.1494244
Aston, G. (2002). The learner as corpus designer. In B. Kettemann, & G. Marko (Eds.), Teaching and learning by doing corpus analysis (pp. 9-25). Amsterdam: Rodopi. http://www.sslmit.unibo.it/~guy/graz.htm
Bennett, S., & Oliver, M. (2011). Talking back to theory: The missed opportunities in learning technology research. Research in learning Technology, 19(3), 179-189. https://doi.org/10.3402/rlt.v19i3.17108
Boulton, A., & Tyne, H. (2014). Corpus-based study of language and teacher education. The routledge handbook of educational linguistics. Routledge.
Boulton, A. (2010). Data-driven learning: Taking the computer out of the equation. Language Learning, 60(3), 534-572. https://doi.org/10.1111/j.1467-9922.2010.00566.x
Boulton, A. (2009). Testing the limits of data-driven learning: Language proficiency and training. ReCALL, 21(1), 37-54.
Boulton, A. (2008). Bringing corpora to the masses: Free and easy tools for language learning. In N. Kübler (Ed.), Corpora, language, teaching, and resorces: From theory to practice. Peter Lang. http://hal.archives-ouvertes.fr/docs/00/32/69/80/PDF/XXXX_boulton_TaLC_ interdisciplinary.pdf
De Cock, S. (2010). Spoken learner corpora and EFL teaching. Corpus-based approaches to English language teaching, (pp. 123-137).
Farr, F. (2008). Evaluating the use of corpus-based instruction in a language teacher education context: Perspectives from the users. Language awareness, 17(1), 25-43. DOI: https://doi.org/10.2167/la414.0
Flowerdew, L. (2005). An integration of corpus-based and genre-based approaches to text analysis in EAP/ESP: Countering criticisms against corpus-based methodologies. English for specific purposes, 24(3), 321-332.
Gavioli, L. (2009). Corpus analysis and the achievement of learner autonomy in interaction. In L. Lombardo (Ed.), Using corpora to learn about language and discourse (pp. 39-71). Peter Lang.
Gilquin, G., & Granger, S. (2010). How can data-driven learning be used in language teaching? The Routledge handbook of corpus linguistics. Routledge.
Karlsen, P. H. (2021). Teaching and Learning English through Corpus-based approaches in Norwegian Secondary Schools: identifying obstacles and a way forward. [Doctoral dissertation]. https://brage.inn.no/innxmlui/handle/11250/2829366
Leech, G. (2014). Teaching and language corpora: A convergence. Teaching and language corpora (pp. 1-24). Routledge.
Reppen, R. (2010). Using corpora in the language classroom. Cambridge: Cambridge University Press.
Römer, U. (2006). Pedagogical applications of corpora: Some reflections on the current scope and a wish list for future developments. Zeitschrift für Anglistik und Amerikanistik, 54(2), 121-134.
Ross, D. (2018). Small corpora and low-frequency phenomena: Try and beyond contemporary, standard English.Corpus, (18). DOI: https://doi.org/10.4000/corpus.3574
Sinclair, S., & Rockwell, G. (2016). Voyant Tools.
Smith, S. (2008). DIY corpora for vocabulary learning. 第二辑, 3-26. S Smith - xuebao.zyufl.edu.cn
Vaughan, E., & Clancy, B. (2013). Small corpora and pragmatics. In Yearbook of corpus linguistics and pragmatics 2013: New domains and methodologies (pp. 53-73). Dordrecht: Springer Netherlands.
Stubbs, M. (2004). Language corpora. In A. Davies, & C. Elder (Eds.), The handbook of applied linguistics, (pp. 106-132). Blackwell Publishing Ltd. DOI: https://doi.org/10.1002/9780470757000
Taherdoost, H. (2019). What is the best response scale for survey and questionnaire design; review of different lengths of rating scale/attitude scale/Likert scale. International Journal of Academic Research in Management, 8(1), 1-10. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3588604
Walsh, S. (2023). Using corpora for English language teacher education (ELTE). Roczniki Humanistyczne, 71(10S), 193-212. https://doi.org/10.18290/rh237110sp-10
Wulff, S., & Baker, P. (2021). Analyzing concordances. A practical handbook of corpus linguistics. Springer International Publishing.
Yoon, H., & Hirvela, A. (2004). ESL student attitudes toward corpus use in L2. Journal of second language writing, 13(4), 257-283. https://doi.org/10.1016/j.jslw.2004.06.002