-
UKP Convincing Arguments v2
The UKPConvArg2 Corpus is introduced in the following paper: Habernal, I., & Gurevych, I. (2016). What makes a convincing argument? Empirical analysis and detecting... -
3D Point Cloud - Altes Hauptgebäude, TU Darmstadt
3D Point Cloud of the old main building at TU Darmstadt (S1|03, Hochschulstr. 1, 64289 Darmstadt, Germany). The raw data was captured with the Hector Tracker robot... -
Sense-annotated English Puns
A pun is a form of wordplay in which a word suggests two or more meanings by exploiting polysemy, homonymy, or phonological similarity to another word, for an intended humorous... -
Wikipedia Edit-Turn-Pairs
Corresponding and Non-Corresponding Edit-Turn-Pairs from the English Wikipedia. The ETP-gold corpus is based on article edits and discussion page turns from the English... -
German Word Choice Problems
The dataset on this page was obtained from the 2001 to 2005 editions of the Reader's Digest Magazine. The dataset has been used to evaluate the performance of semantic... -
Knowledge-based Semantic Role Labeling
Automatically frame- and role-labeled WaSR-corpora for English (WaSR_L_v1 and WaSR_XL_v1) WaSR-en-part 1 WaSR-en_part 2 WaSR_en_part 3 Automatically... -
Object of Fixation Dataset
This dataset was created in order to evaluate different models for detecting the driver's current object of fixation, i.e. finding the object the driver is looking at, when... -
Simple–complex Sentence Pairs
The simple–complex sentence pair dataset created in the paper. -
Yeast Cells in Microstructures Dataset
Yeast cell instance-segmentation dataset of the paper "An Instance Segmentation Dataset of Yeast Cells in Microstructures" [EMBC, 2023]. https://arxiv.org/abs/2304.07597 We... -
YAGS "Yahoo! Annotated Gold Standard"
This folder contains the data files and scripts to compile the YAGS "Yahoo! Annotated Gold Standard" annotated with FrameNet frames and roles as published. To compile the... -
Email Disentanglement
Enron Threads Corpus and Enron Crowdsourced Dataset -
From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains
This dataset has no description
-
Multilingual UKP Sentential Argument Mining Corpus
This dataset is an extension of the original UKP Sentential Argument Mining Corpus which includes 25,492 sentences over eight controversial topics. Each sentence was annotated... -
Darmstadt Service Review Corpus
The Darmstadt Service Review Corpus consists of consumer reviews annotated with opinion related information at the sentence and expression levels. -
Relation Classification
Semantic relatedness data -
Availability Test
How to set e-mail request access? This is the question that is hopefully answered with this dataset. -
CLEVR-Hans3
A compositionally complex data set for investigating confounders and explainability. -
Supplementary materials: Mining Legal Arguments in Court Decisions
Pre-trained transformer models; accompanying materials to the paper and its GitHub repository -
Insufficiently Supported Arguments in Argumentative Essays
This corpus includes 1029 arguments taken from argumentative essays. Each argument is annotated as “insufficient” if its premises do not provide enough evidence for accepting or... -
Context-Aware Representations for Knowledge Base Relation Extraction
We provide a subcorpus of Wikipedia that was annotated with Wikidata relations using a distant supervision procedure. The corpus contains two types of annotations: entities and...