33 datasets found

Keywords: learner corpus

Filter Results
  • Business English learner speech corpus SAPS

    SAPS is a specialized speech corpus which contains business meeting simulations in English between undergraduate students of Languages for Business and Economics at the School...
  • KAMOKO-Digitalizer

    This editor was developed especially for the needs of the KAMOKO project (https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-3261). The editor allows the quick entry...
  • AKCES 5 (CzeSL-SGT) Release 2

    Essays written by non-native learners of Czech, a part of AKCES/CLAC – Czech Language Acquisition Corpora. CzeSL-SGT stands for Czech as a Second Language with Spelling, Grammar...
  • Czesl - Universal Dependencies Release 0.5

    Syntactic annotation of 1600 sentences from the Czesl-MAN corpus using the framework of Universal Dependencies 2.3
  • KAMOKO: KAsseler MOrgenstern KOrpus

    KAMOKO is a structured and commented french learner-corpus. It addresses the central structures of the French language from a linguistic perspective (18 different courses). The...
  • KAMOKO: KAsseler MOrgenstern KOrpus (2021-02-09)

    KAMOKO is a structured and commented french learner-corpus. It addresses the central structures of the French language from a linguistic perspective (18 different courses). The...
  • AKCES 5 (CzeSL-SGT)

    Essays written by non-native learners of Czech, a part of AKCES/CLAC – Czech Language Acquisition Corpora. CzeSL-SGT stands for Czech as a Second Language with Spelling, Grammar...
  • euroWiss - Linguistic Profiling of European Academic Education (Subcorpus 1) ...

    Subcorpus 1 presents part of the euroWiss-Corpus covering communication in teaching/learning discourses in instruction at German and Italian universities, in the humanities as...
  • Hamburg Modern Times Corpus (HaMoTiC)

    Audio recordings of a film retelling task with adult L2 users of German. The speakers' L1 and their L2 proficiencies vary. 24 communications + 1 German reference...
  • ZISA

    Audio recordings of five adult learners of German as an L2 with L1s Spanish, Italian and Portuguese. Recording sessions (interview/conversation) in German once or twice a month...
  • The Hamburg MapTask Corpus (HAMATAC)

    Audio recordings of map tasks with adult L2 users of German. The speakers´ L1 and their L2 proficiencies vary. The maps used for the tasks are available. Audioaufnahmen...
  • The Hamburg MapTask Corpus (HAMATAC)

    Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks...
  • ZISA_BR_ZI

    Sub-corpus of the ZISA project with one Italian and one Portuguese learner. The ZISA project contains audio recordings of five adult learners of German as an L2 with L1s...
  • VESPA

    The aim of the VESPA learner corpus project is to build a large collection of disciplinary writing by L2 English university students across registers, disciplines and degrees of...
  • Replication Data for: (Re-)Constructing Questions

    This is the replication data for a paper submitted to an academic journal. The abstract of the paper follows. This paper discusses the use of second language structures,...
  • Core Metadata Schema for Learner Corpora (version 1)

    The Core Metadata Schema for Learner Corpora is an extensive revision of Granger & Paquot's (2017) Core Metadata [Schema] for Learner Corpora Draft 1.0 in the field of...
  • Core Metadata [Schema] for Learner Corpora Draft 1.0

    First proposal towards a "Core Metadata [Schema] for Learner Corpora", presented at the "CLARIN workshop on Interoperability of Second Language Resources and Tools", Gothenburg,...
  • Beldeko Summary Corpus v1.0.0

    Beldeko Summary Corpus v1.0.0 The Beldeko (Belgisches Deutschkorpus) Summary Corpus is a learner corpus that consists of summaries written by advanced L2 German learners (CEF...
  • KoKo German L1 Learner Corpus 4

    The KoKo Corpus is an error-annotated learner corpus of L1 German speakers. It has been created with the aim to investigate and describe the writing skills of German-speaking...
  • Core Metadata Schema for Learner Corpora (version 2)

    This document contains a list of metadata fields that can be used to describe learner corpus data. The core metadata scheme is structured around 8 metadata types: -...
You can also access this registry using the API (see API Docs).