This paper studies strategies for improving college students’ English writing literacy based on online corpora on the Internet. Factors related to learners of English writing corpora include corpus sources, years of English learning, educational level, and major. Task factors include text types, writing time limits, and the use of reference books. The corpus consists of three subcorpora: General English, Business English, and Academic English, with a storage capacity of 3.256 million words. Its main uses include setting writing standards for science and engineering college students, constructing autonomous learning platforms, studying the characteristics of interlanguage phrases, conducting diachronic studies of interlanguage, and translation studies.
Al-Kharabsheh A, Hamadeh N, 2017, Shifts of Cohesion and Coherence in the Translation of Political Speeches. Advances in Language and Literary Studies, 5: 26–28.
Behnam B, Yaghchi MA, 2019, The Impact of Formal Instruction of References and Conjunctions on Reading Comprehension of Iranian ESP Students. Procedia – Social and Behavioral Sciences, 9: 62–66.
Boas F, 1940, Race, Language and Culture. Macmillan, 1940: 1–237.
Chanyoo N, 2016, A Corpus-Based Study of Connectors and Thematic Progression in the Academic Writing of Thai EFL Students. ProQuest LLC, 15: 33–36.
Manning CD, Schütze H, 1999, Foundations of Statistical Natural Language Processing. MIT Press, USA.
Aston G, Burnard L, 1998, The BNC Handbook. Edinburgh University Press, 1998: 1–268.
Atkins S, Clear J, Ostler N, 1992, Corpus Design Criteria. Literary and Linguistic Computing, 11: 102–106.
Leech G, 1992, Computers and Corpus Analysis. Computers and Written Texts, 1992: 1–246.
Scott M, 2008, WordSmith Tools Version 5. Lexical Analysis Software, 2008: 1–276.
Mohammed, Sadiya A, 2015, Conjunctions as Cohesive Devices in the Writings of English as Second Language Learners. Procedia – Social and Behavioral Sciences, 5: 22–26.
Petersen U, 2004, Emdros – A Text Database Engine for Analyzed or Annotated Text. International Conference on Computational Linguistics, 2004: 1–253.
Ravid D, Berman RA, 2010, Developing Noun Phrase Complexity at School Age: A Text-embedded Cross-linguistic Analysis. First Language, 10: 98–101.
Read J, 2000, Assessing Vocabulary. Cambridge University Press, London.
Richards B, 1987, Type/Token Ratios: What Do They Really Tell Us? Journal of Child Language, 11: 88–90.
Vygotsky LS, 1978, Mind and Society: The Development of Higher Psychological Processes. Harvard University Press, Cambridge.