CORPUS TOOLS AND SOFTWARE: OVERVIEW AND APPLICATIONS
Keywords:
Corpus linguistics, corpus tools, AntConc, Sketch Engine, WordSmith Tools, LancsBox, concordance, collocation, frequency analysis, keyness, computational linguistics, data-driven learning, lexicography, translation studies, digital humanities, annotation, part-of-speech tagging, text mining, linguistic research, applied linguistics, corpus analysis.Abstract
This article provides an overview of corpus tools and software used in modern linguistic research and language education. Corpus tools are specialized computer programs that enable the collection, organization, and analysis of large language datasets, known as corpora. The study also highlights how corpus software supports data-driven learning (DDL), allowing students and teachers to explore authentic examples of language use. Furthermore, it emphasizes the integration of corpus technology in computational linguistics and digital humanities, where automatic annotation, part-of-speech tagging, and keyword extraction contribute to understanding language variation, discourse, and cultural trends. By combining quantitative precision with qualitative interpretation, corpus tools have become indispensable instruments for both theoretical and applied linguistics, promoting evidence-based approaches to language study and pedagogy
References
1.Anthony, L. AntConc (Version 4.0). Waseda University, 2020.
2. Scott, M. WordSmith Tools Version 4. Oxford University Press, 1999.
3. Kilgarriff, A., Rychlý, P., Smrž, P., & Tugwell, D. The Sketch Engine. Lexical Computing, 2014.
4. McEnery, T., & Hardie, A. Corpus Linguistics: Method, Theory and Practice. Cambridge University Press, 2012.
5. Baker, P. Using Corpora in Discourse Analysis. Continuum, 2006.
6. Brezina, V., McEnery, T., & Wattam, S. LancsBox: Developing a New Corpus Analysis Tool. CL2015 Conference Proceedings, Lancaster University, 2015.
7. Johns, T. From Printout to Handout: Grammar and Vocabulary Teaching in the Context of Data-Driven Learning. ELR Journal, 1991.
8. Hunston, S. Corpora in Applied Linguistics. Cambridge University Press, 2002.
9. Leech, G. Language Variation and Change. Routledge, 2000.
10. Stubbs, M. Text and Corpus Analysis. Blackwell, 1996.