-
COREC – A neural multi-label COmmonsense RElation Classification system
We examine the learnability of Commonsense knowledge relations as represented in CONCEPTNET. We develop a neural open world multi-label classification system that focuses on the... -
elAPI
elAPI is a powerful, extensible API client for eLabFTW developed at the University Computing Centre (URZ, FIRE division) of University of Heidelberg. It supports serving all... -
Neural Dependency Parser with Biaffine Attention
This resource contains the code of the dependency parser used in the paper: Fankhauser, et al. (2020). "Evaluating a Dependency Parser on DeReKo". The parser is a... -
Neural PP Attachment Disambiguation Systems
This resource contains code for different types of neural PP attachment disambiguation systems: A disambiguation system inspired by de Kok et al. (2017) but with the ranking... -
Accompanying Code for Chapter 4 of the PhD Thesis "Global Inference and Local...
This release contains the source code used for Chapter 4 of the PhD thesis "Global Inference and Local Syntax Representations for Event Extraction". The code served as a testbed... -
KGE Algorithms
An updated method for link prediction that uses a regularization factor that models relation argument types Abstract (Kotnis and Nastase, 2017): Learning relations based on... -
LIDO-Handbuch für die Erfassung und Publikation von Metadaten zu kulturellen ...
LIDO (Lightweight Information Describing Objects) ist ein XML-Schema für die standardkonforme Bereitstellung von Metadaten über kulturelle Objekte in einer Vielzahl von... -
Source Code, Data and Additional Material for the Thesis: "Social Commonsense...
Understanding a social situation requires the ability to reason about the underlying emotions and behaviour of others. For example, when we read a personal story, we use our... -
MACE-AL
A method for detecting noise in automatically annotated sequence-labelled data, combining MACE (Hovy et al. 2013) with Active Learning. -
Head Selection Parsers and LSTM Labelers
This resource contains code, data and pre-trained models for various types of neural dependency parsers and LSTM labelers used in the papers: Do et al. (2017). "What Do We Need... -
GER_SET: Situation Entity Type labelled corpus for German
Semantic clause types, also called Situation Entity (SE) types (Smith, 2003) are linguistic characterizations of aspectual properties shown to be useful for tasks like... -
Abstract graphs, abstract paths, grounded paths for Freebase and NELL
We describe a method for representing knowledge graphs that capture an intensional representation of the original extensional information. This representation is very compact,... -
German Twitter Titling Corpus
The German Titling Twitter Corpus consists of 1904 stance-annotated tweets collected in June/July 2018 mentioning 24 German politicians with a doctoral degree. The Addendum... -
Ground Truth transcriptions for training OCR of historical Bengali printed te...
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print... -
Source Code, Data and Additional Material for the Thesis: "Aspects of Coheren...
This dataset contains source code and system output used in the PhD thesis "Aspects of Coherence for Entity Analysis". This dataset is split into three parts corresponding to... -
FeReRe: Feedback Requirements Relation using Large Language Models [data]
This dataset consists of 3 parts: The "related_work.bib" contains citations for the Related Work section of the paper. The "ChatGPTPrompts.xlsx" contains a list of all prompt... -
Normalization of HE-Stained Histological Images using Cycle Consistent Genera...
Here we provide the data sets supporting the experiments in our publication Normalization of HE-Stained Histological Images using Cycle Consistent Generative Adversarial... -
Synthesis of CT images from digital body phantoms using CycleGAN [dataset]
The potential of medical image analysis with neural networks is limited by the restricted availability of extensive data sets. The incorporation of synthetic training data is... -
Source code and data for the PhD Thesis "Measuring the Contributions of Visio...
This dataset contains source code and data used in the PhD thesis "Measuring the Contributions of Vision and Text Modalities in Multimodal Transformers". The dataset is split... -
LLMs4Implicit-Knowledge-Generation Public
Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statements between two sentences, by (i)...