CORPUS TOOLS AND SOFTWARE: OVERVIEW AND APPLICATIONS

Authors

  • Saidova Dilobar Author
  • Abdullajonova Hakima Author

Keywords:

Corpus linguistics, corpus tools, AntConc, Sketch Engine, WordSmith Tools, LancsBox, concordance, collocation, frequency analysis, keyness, computational linguistics, data-driven learning, lexicography, translation studies, digital humanities, annotation, part-of-speech tagging, text mining, linguistic research, applied linguistics, corpus analysis.

Abstract

This article provides an overview of corpus tools and software used in modern linguistic research and language education. Corpus tools are specialized computer programs that enable the collection, organization, and analysis of large language datasets, known as corpora. The study also highlights how corpus software supports data-driven learning (DDL), allowing students and teachers to explore authentic examples of language use. Furthermore, it emphasizes the integration of corpus technology in computational linguistics and digital humanities, where automatic annotation, part-of-speech tagging, and keyword extraction contribute to understanding language variation, discourse, and cultural trends. By combining quantitative precision with qualitative interpretation, corpus tools have become indispensable instruments for both theoretical and applied linguistics, promoting evidence-based approaches to language study and pedagogy

References

1.Anthony, L. AntConc (Version 4.0). Waseda University, 2020.

2. Scott, M. WordSmith Tools Version 4. Oxford University Press, 1999.

3. Kilgarriff, A., Rychlý, P., Smrž, P., & Tugwell, D. The Sketch Engine. Lexical Computing, 2014.

4. McEnery, T., & Hardie, A. Corpus Linguistics: Method, Theory and Practice. Cambridge University Press, 2012.

5. Baker, P. Using Corpora in Discourse Analysis. Continuum, 2006.

6. Brezina, V., McEnery, T., & Wattam, S. LancsBox: Developing a New Corpus Analysis Tool. CL2015 Conference Proceedings, Lancaster University, 2015.

7. Johns, T. From Printout to Handout: Grammar and Vocabulary Teaching in the Context of Data-Driven Learning. ELR Journal, 1991.

8. Hunston, S. Corpora in Applied Linguistics. Cambridge University Press, 2002.

9. Leech, G. Language Variation and Change. Routledge, 2000.

10. Stubbs, M. Text and Corpus Analysis. Blackwell, 1996.

Published

2025-11-08

How to Cite

[1]
2025. CORPUS TOOLS AND SOFTWARE: OVERVIEW AND APPLICATIONS. Ustozlar uchun. 83, 2 (Nov. 2025), 85–89.