145 datasets found

Keywords: linguistics

Filter Results
  • B4 Tatian Corpus of Deviating Examples 2.1

    The present corpus, the Tatian Corpus of Deviating Examples T-CODEX 2.1, provides morpho-syntactic and information structural annotation of parts of the Old High German...
  • Catalan in a bilingual context (PhonCAT)

    Audio recordings of prompted, read and spontaneous speech data from L1 Catalan speakers from Barcelona. The data is stratified according to three different city districts and...
  • A5 Hausa Umarnin Uwa

    This corpus of Umarnin Uwa film transcripts contains 47 transcripts with a total of 10194 tokens. It provides information including automatic POS tagging, speaker and...
  • Perceptual Dialectology; Wahrnehmungsdialektologie

    Audio recordings of interactions where mental maps of various dialects of (spoken) German were elicited from adult subjects belonging to three age groups and various...
  • The Spoken Wikipedia Corpora

    The Spoken Wikipedia project unites volunteer readers of Wikipedia articles. Hundreds of spoken articles in multiple languages are available to users who are – for one...
  • WebGIS-Anwendung: Der deutsche Sprachraum aus der Sicht linguistischer Laien

    The Web-GIS for Wahrnehmungsdialektologie shows results of the collection of mental maps of the project "Der deutsche Sprachraum aus der Sicht linguistischer Laien"....
  • TraCES

    Corpus of the Classical Ethiopic Language (Ge'ez), produced by the TraCES project (https://www.traces.uni-hamburg.de/en/about.html) in 2014-2019. The corpus is...
  • Der deutsche Sprachraum aus der Sicht linguistischer Laien - wahrnehmungsdial...

    Im Forschungsprojekt "Der deutsche Sprachraum aus der Sicht linguistischer Laien" wurden die laienlinguistischen Konzeptualisierungen zur deutschen Sprache untersucht....
  • ZISA

    Audio recordings of five adult learners of German as an L2 with L1s Spanish, Italian and Portuguese. Recording sessions (interview/conversation) in German once or twice a month...
  • Parameterfixierung im Deutschen und Spanischen (PAIDUS)

    Audio recordings of five German and five Spanish speaking monolingual children. For the German children there are 175 recordings (interviewer/child interaction), on an average...
  • DUFDE_AN_PA

    Subcorpus (two children) from the DUFDE project. Video recordings of two German/French simultaneous bilingual children, starting at approx. 1 year and 6 months. One or two...
  • Covert translation: Business Communication (old)

    Translation corpora of original texts with translations and comparable texts from the genre external business communication. CLARIN Metadata summary for Covert translation:...
  • Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mi...

    The project "Reference Corpus Middle Low German/Low Rhenish (1200–1650)", abbreviated as "ReN", is part of the "Corpus of Historical German...
  • Phonologie-Erwerb Deutsch-Spanisch als Erste Sprachen (PEDSES)

    Audio recordings of three German/Spanish simultaneous bilingual children starting at approx. 1 year and ending between the ages 2;4 and 3 years. There are 125 recording sessions...
  • Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mi...

    The project "Reference Corpus Middle Low German/Low Rhenish (1200–1650)", abbreviated as "ReN", is part of the "Corpus of Historical German...
  • DUFDE

    Video recordings of seven German/French simultaneous bilingual children, starting at approx. 1 year and 6 months. One or two recordings eachmonth until approx. 5 years and 6...
  • INEL Bibliographie

    The bibliography comprises 2056 entries including references to all relevant linguistic and ethnologic publications for Selkup and Kamas language, further more...
  • B4 Historisches Predigtenkorpus zum Nachfeld

    HIPKON is the first corpus based on only one text type (sermons) and on one dialect area, Upper German (Bavarian-Alemannic). The sermons cover the time from Middle High German...
  • B2 Marghi

    Full set: all focus related experiments, status: work in progress, large parts elicited, most of the data transcribed, partly annotated. CLARIN Metadata summary for B2 Marghi...
  • B1 Fon

    The data sets for each language consist of a small number of mini-dialogues, chosen out of the 189 entries within the Focus Translation Task (cf. Skopeteas et al. 2006: 209ff.)...
You can also access this registry using the API (see API Docs).