-
Source code and data for the PhD Thesis "Linguistically-Inspired Neural Coher...
This dataset contains source code and data used in the PhD thesis "Linguistically-Inspired Neural Coherence Modeling". The dataset is split into five repositories: StruSim:... -
Phonologischer Erwerb des Galicischen als Zweitsprache: Eine qualitative Anal...
Dieses Datenpaket enthält Audio- und Begleitdaten aus dem Masterarbeitsprojekt „Der phonologische Erwerb des Galicischen als Zweitsprache: Eine qualitative Analyse... -
Data for the PhD thesis "Modeling Lexical Fields for Translation: a Corpus-B...
This dataset contains in high resolution all graphical visualizations of data analysis provided in my doctoral dissertation. The graphs are organized according to chapters and... -
CARDIO:DE [V1.1.1]
Version information CARDIO:DE 1.1.1 Minor formatting updates (Previous version: CARDIO:DE 1.1) Abstract: We present CARDIO:DE, the first freely available and distributable... -
Heidelberg Bibliography of Translations of Nonfictional Texts [data]
This project, funded by the German Research Foundation, compiles an online bibliography of German translations of nonfictional texts published between 1450 and 1850. It includes... -
Turkology Annual Online – Full bibliographic records
The "Turkologischer Anzeiger/Turkology Annual" (TA), founded by Andreas Tietze (†) and György Hazai (†), is an indispensable systematic bibliography for Turkology and Ottoman... -
ChiSCor: Children's Story Corpus
ChiSCor is a new corpus containing 619 fantasy stories, told freely by 442 Dutch children aged 4-12. ChiSCor was compiled for studying how children render character... -
Impact of manipulating word boundaries on the information distributed in morp...
These plots are part of the study "Impact of manipulating word boundaries on the information distributed in morphology and syntax". Each plot represents the word-structure... -
Learning from climate change news: Is the world on the same page?
Climate change challenges countries around the world, and news media are key to the public’s awareness and perception of it. But how are news media approaching climate change... -
CorpusExplorer
Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks... -
Corpora of patient information sheets and consent forms for UK cancer trials ...
Obtaining informed consent is an ethical imperative when conducting research involving human participants. However, participants’ actual level of understanding is often... -
Heidelberg Bibliography of Translations of Nonfictional Texts [data]
This project, funded by the German Research Foundation, compiles an online bibliography of German translations of nonfictional texts published between 1450 and 1850. It includes... -
Salience of color terms in real texts in a wide cross-linguistic study
This dataset collects the different labels used in different languages of the world for basic word colours, according to Berlin and Kay, based on PanLex. It also provides the...