LOCOLE (Longitudinal Corpus of Learner English)

PID

Information about LOCOLE This corpus comprises essays written by university students of English Philology over the course of one academic year. The essays were collected four times during the 2024-2025 academic year. They were all written by hand in the classroom, without access to any reference tools. The essays were manually keyboarded, preserving the authentic learner writing, including non-standard spelling, language use, and punctuation. Essay topics are provided in Table 1 below. Table 1. Essays in the corpus Cohort Essay topic Number of essays Date of data collection 1. Is education the key to success? 28 September 2. University pressures 26 October 3. Can AI replace human teachers? 29 January 4. Linguistic theories have no place outside academia 26 May 109 in total

Text ID Each text in the corpus has a unique ID code, for example, 1_F_01. The first number in the code represents the cohorts by the time of data collection and topic prompt. The last number codes each participant's sequential number in the list (see the CSV file). The letter indicates the gender of the participant: - F stands for ‘Female’, - M stands for “Male’, - O stands for ‘Other’, - N shows that the participant preferred not to indicate their gender. More detailed information about the participants is available in the attached CSV file.

Funding sources The digitization of the corpus was supported by the Research Council of Lithuania as a Student Summer Internship project.

Identifier
PID http://hdl.handle.net/20.500.11821/74
Related Identifier https://web.vu.lt/flf/r.jukneviciene/?page_id=938
Metadata Access https://clarin.vdu.lt/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin.vdu.lt:20.500.11821/74
Provenance
Creator Juknevičienė, Rita; Vilkaitė-Lozdienė, Laura; Kasteckienė, Jurga; Salei, Palina
Publisher Vilnius University
Publication Year 2025
Rights ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT; ACA; https://clarin.vdu.lt/licenses/eula/ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm
OpenAccess true
Contact info(at)clarin.vdu.lt
Representation
Language English
Resource Type corpus
Format application/zip; application/octet-stream; text/plain; charset=utf-8; downloadable_files_count: 2
Discipline Linguistics