-
Geometric landscapes for material discovery within energy-structure-function ...
Porous molecular crystals are an emerging class of porous materials formed by crystallisation of molecules with weak intermolecular interactions, which distinguishes them from... -
QMrxn20: Thousands of reactants and transition states for competing E2 and SN...
For competing E2 and SN2 reactions, we report 4'400 validated transition state geometries and 143'200 reactant complex geometries including conformers obtained at MP2/6-311G(d)... -
Coral fragments health and growth in the Reefscapers propagation project in t...
The Reefscapers program is a coral restoration initiative to help reef recovery in the Maldives. It started in 2001, and is most active since the 2015-2016 mass bleaching event.... -
Deforestation maps using time series of Sentinel-2A images in Amazonia, betwe...
This data set includes deforestation maps, located in the border between the west of Brazil and the north of Bolivia (corresponding to Sentinel-2's tile 20LKP). The source... -
High resolution native forest map of Eastern Arc Mountains
This study aims to develop more accurate method for mapping closed canopy evergreen natural forest (CCEF) of the Eastern Arc Mountains (EAM) ecoregion in Tanzania and Kenya, to... -
Daily summary of weather, snow, and preferential-flow conditions at the Snow ...
This dataset includes daily summaries of weather, snow, and preferential-flow conditions at the Snow and Ice Research Center, Nagaoka (Japan) -- snow seasons 2006 through... -
AntAir: satellite-derived 1km daily Antarctic air temperatures since 2003-201...
AntAir is a dataset of gridded air temperatures in 1km spatial and daily temporal resolution currently available for the years 2003-2016.AntAir was created by modelling daily... -
AntAir: satellite-derived 1km daily Antarctic air temperatures 2003-2016, lin...
AntAir is a dataset of gridded air temperatures in 1km spatial and daily temporal resolution currently available for the years 2003-2016. AntAir was created by modelling daily... -
Estimating nitrogen and phosphorus concentrations in streams and rivers acros...
Nitrogen (N) and Phosphorus (P) are essential nutritional elements for life processes in water bodies. However, in excessive quantities, they may represent a significant source... -
WMT18 Quality Estimation Shared Task Test Data
Test data for the WMT18 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-2619. This shared task will build on its previous six editions to further... -
WMT18 Quality Estimation Shared Task Training and Development Data
Training and development data for the WMT18 QE task. Test data will be published as a separate item. This shared task will build on its previous six editions to further examine... -
WMT16 APE Shared Task Data - Reference sentences
Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the... -
WMT17 Quality Estimation Shared Test Data
Test data for the WMT17 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-1974 This shared task will build on its previous five editions to further... -
WMT17 Quality Estimation Shared Task Training and Development Data
Training and development data for the WMT17 QE task. Test data will be published as a separate item. This shared task will build on its previous five editions to further examine... -
WMT16 Quality Estimation Shared Task Training and Development Data
Training and development data for the WMT16 QE task. Test data will be published as a separate item. This shared task will build on its previous four editions to further examine... -
WMT16 APE Shared Task Data
Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to... -
Corpus of contemporary blogs
In NLP Centre, dividing text into sentences is currently done with a tool which uses rule-based system. In order to make enough training data for machine learning, annotators... -
AQ-Bench
The AQ-Bench Benchmark dataset as described in Betancourt et al. (manuscript): "AQ-Bench: A Benchmark Dataset for Machine Learning on Global Air Quality Metrics" . See... -
Fast earthquake assessment and earthquake early warning dataset for Italy
The data publication contains a dataset for fast assessment of earthquakes and early warning based on seismic waveforms. The dataset encompasses Italy and surrounding refions.... -
UVP5 data sorted with EcoTaxa and MorphoCluster
Here, we provide plankton image data that was sorted with the web applications EcoTaxa and MorphoCluster. The data set was used for image classification tasks as described in...