Dataset - B2FIND

Datasets and R scripts for modelling Czech translation counterparts of Romanc...

This repository contains the datasets and code used in the study “Predicting translation counterparts in causative constructions.” The datasets consist of annotated examples of...

Genusvariasjon i norsk skriftspråk

Dette datasettet inneheld materialet ifrå ei undersøking av genusvariasjon i norsk skriftspråk. Undersøkinga har sitt utspring i to oppdrag eg fekk ifrå Språkrådet om å...

Background data for: An analysis of bar chart usage in corpus data visualization

Dataset description This dataset contains information about the use of bar charts for corpus data presentation. It is based on a systematic review covering all papers (n =...

Whatsapp corpus Berntzen

Whatsapp conversations collected by master students Communication & Information Studies (2013-2014; 2014-2015). All participants in the conversations are over 18 and have signed...

Posture-verb constructions in Dutch and German

The dataset is part of the research published in Okabe (forthcoming). The data comprise two .csv files: database_nl.csv and database_de.csv.

Whatsapp corpus Verheijen

Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently,...

Whatsapp corpus Verheijen

Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently,...

Whatsapp corpus Berntzen

Whatsapp conversations collected by master students Communication & Information Studies (2013-2014; 2014-2015). All participants in the conversations are over 18 and have signed...

DigiLing e-Learning Hub: e-Courses for Digital Linguistics

The files represent exported e-learning resources created within the DigiLing project, www.digiling.eu. We have identified seven core subjects in Digital Linguistics and built...

Replication data for: Constructions and language change: From genitive to acc...

This article reports on a corpus study of ongoing language change in Russian, whereby genitive-governing verbs like bojat’sja “fear” combine with objects in the accusative in...

Background data for: “I regret lying” VS. “I regret that I lied”: Variation i...

This dataset contains tabular files recording occurrences of the verb REGRET complemented by a that- or (S) -ing-complement clause (CC) in the GloWbE corpus. Tokens were...

Replication Data for: The decade construction rivalry in Russian: Using a cor...

This dataset contains 3 data files, 5 files with R code, and a short read-me file with documentation. The data files contain information about the development of two competing...

Background data for: Regional syntactic variability in the complementation sy...

This dataset contains tabular files recording occurrences of the verb REGRET complemented by a that- or -ing-complement clause (CC) in the GloWbE corpus. Tokens were retrieved...

Background data for: Regression and random forests: Synergies for variationis...

This dataset contains tabular files recording occurrences of the verb REGRET complemented by a that- or -ing-complement clause (CC) in the GloWbE corpus. Tokens were retrieved...

Background data for: Probabilistic variability in clausal verb complementatio...

This dataset contains tabular files recording occurrences of the verb REGRET in the GloWbE corpus. Tokens were retrieved using the online interface...

Replication Data for: Metaphors in high-stakes language exams

The dataset and R code here provide documentation for the chapter "Metaphors in high-stakes language exams". Chapter abstract: Lakoff and Johnson’s Conceptual Metaphor Theory...

Background data for: Advancing our understanding of dispersion measures in co...

Dataset description This dataset contains background data and supplementary material for Sönning (forthcoming), a study that looks at the behavior of dispersion measures when...

Background data for: Negation as a predictor of clausal complement choice in ...

This dataset contains tabular files recording occurrences of the verb REGRET complemented by a finite or non-finite complement clause (CC) in the GloWbE corpus. Tokens were...

Biber et al.'s (2016) set of 150 BNC items for the analysis of dispersion mea...

This dataset contains frequencies for a set of 150 word forms in the BNC. The set of items was compiled by Biber et al. (2016) for the purpose of analyzing the behavior of...

Replication Data for: Figurative production in a computer-mediated discussion...

The datasets , R code and suppementary data file provide documentation for the chapter "Figurative production in a computer-mediated discussion forum: Metaphors about...

54 datasets found