-
DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains... -
Extensions to the Slovene translation of SuperGLUE
SuperGLUE is a benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a public leaderboard. It is comprised of 8... -
Slovenian datasets for contextual synonym and antonym detection
Slovenian datasets for contextual synonym and antonym detection can be used for training machine learning classifiers as described in the MSc thesis of Jasmina Pegan "Semantic... -
Slovenian Word in Context dataset SloWiC 1.0
The SloWIC dataset is a Slovenian dataset for the Word in Context task. Each example in the dataset contains a target word with multiple meanings and two sentences that both... -
Slovene translation of the SQuAD2.0 dataset
Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to... -
Slovene Translation of the Atomic 2020 data set SloATOMIC 2020
The SloATOMIC 2020 corpus contains the Slovene translations of the ATOMIC 2020 data set, a commonsense knowledge graph with 1.33M everyday inferential knowledge tuples about... -
MultiEmo: Multilingual, Multilevel, Multidomain Sentiment Analysis Corpus of ...
MultiEmo, a new benchmark data set for the multilingual sentiment analysis task including 11 languages. The collection contains consumer reviews from four domains: medicine,... -
Interview Guideline, Transcriptions, and Coding for "A Consolidated Framework...
Dataset for the article: Herm, Lukas-Valentin ; Janiesch, Christian ; Helm, Alexander ; Imgrund, Florian ; Fuchs, Kevin ; Hofmann, Adrian ; Winkelmann, Axel: A Consolidated... -
A Source Collection on Urban annuities, 14th–18th centuries
The data published here comprises interest rates computed from annuities sold by urban authorities across the Holy Roman Empire and Italy from the 14th to 18th centuries.... -
SynthCity Dataset - Trajectory
With deep learning becoming a more prominent approach for automatic classification of three-dimensional point cloud data, a key bottleneck is the amount of high quality training... -
SynthCity Dataset - Area 3 (Test)
With deep learning becoming a more prominent approach for automatic classification of three-dimensional point cloud data, a key bottleneck is the amount of high quality training... -
EPISURG: a dataset of postoperative magnetic resonance images (MRI) for quant...
EPISURG is a clinical dataset of T1-weighted magnetic resonance images (MRI) from 430 epileptic patients who underwent resective brain surgery at the National Hospital of... -
Dataset for manuscript: Women’s preferences for receiving uncertain results f...
We conducted a survey containing a discrete choice experiment to understand the test features that drive women's preferences for prenatal genomic testing, and explore variation... -
SynthCity Dataset - Complete
With deep learning becoming a more prominent approach for automatic classification of three-dimensional point cloud data, a key bottleneck is the amount of high quality training... -
Replication package for "Cost Measures Matter for Mutation Testing Study Vali...
This is a replication package for the experiments reported in 2020 FSE paper "Cost Measures Matter for Mutation Testing Study Validity".For more information, please read... -
Randomly-displaced methane configurations
Most of the datasets to benchmark machine-learning models contain minimum-energy structures, or small fluctuations around stable geometries, and focus on the diversity of... -
Randomly-displaced methane configurations
Most of the datasets to benchmark machine-learning models contain minimum-energy structures, or small fluctuations around stable geometries, and focus on the diversity of... -
2020_GLOBECOM_LTE-DL-static-rural-and-urban-outdoor_dataset
This repository contains measurement data that are presented in the paper "On the Stability of RSRP and Variability of Other KPIs in LTE Downlink – An Open Dataset" Submitted... -
2020_GLOBECOM_LTE-DL-static-urban-indoor_dataset
This repository contains measurement data that are presented in the paper "Real World Performance of LTE Downlink in a Static Dense Urban Scenario – An Open Dataset" Submitted... -
Dataset: A geodata derived European neighborhood graph of all NUTS-3 regions ...
The graph presented in this dataset contains all European NUTS-3 regions and their metadata keyed by NUTS Level 3 ID, provided by the Eurostat as of 04/2020. 215 out of...