-
WMT16 Quality Estimation Shared Task Training and Development Data
Training and development data for the WMT16 QE task. Test data will be published as a separate item. This shared task will build on its previous four editions to further examine... -
WMT17 Quality Estimation Shared Task Training and Development Data
Training and development data for the WMT17 QE task. Test data will be published as a separate item. This shared task will build on its previous five editions to further examine... -
WMT16 APE Shared Task Data
Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to... -
WMT16 APE Shared Task Data - Reference sentences
Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the... -
Corpus of contemporary blogs
In NLP Centre, dividing text into sentences is currently done with a tool which uses rule-based system. In order to make enough training data for machine learning, annotators... -
SnakeCLEF 2021
The dataset with 409,679 images belonging to 772 snake species from 188 countries and all continents (386,006 images with labels targeted for development and 23,673 images... -
WMT17 Quality Estimation Shared Test Data
Test data for the WMT17 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-1974 This shared task will build on its previous five editions to further... -
WMT18 Quality Estimation Shared Task Test Data
Test data for the WMT18 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-2619. This shared task will build on its previous six editions to further... -
Model files for the Neural network-based model of Electron density in the Top...
Here, we present model files and example scripts for the Neural network-based model of Electron density in the Topside ionosphere (NET). The model is based on radio occultation... -
Fast earthquake assessment dataset for Chile
The data publication contains a dataset for fast assessment of earthquakes based on seismic waveforms. The dataset encompasses Northern Chile. Due to the large scale of the... -
Fast earthquake assessment and earthquake early warning dataset for Italy
The data publication contains a dataset for fast assessment of earthquakes and early warning based on seismic waveforms. The dataset encompasses Italy and surrounding refions.... -
TEAM – The Transformer Earthquake Alerting Model
TEAM, the Transformer Earthquake Alerting Model is a deep learning model for real time estimation of peak ground acceleration (TEAM), earthquake magnitude and earthquake... -
Data publication: Bubble size distribution and electrode coverage at porous n...
Porous materials are frequently used as e.g. electrodes or porous transport layers in various types of electrolyzers. A better understanding of the bubble dynamics on porous... -
Accelerating Finite-temperature Kohn-Sham Density Functional Theory with Deep...
Output from electronic structure code (Quantum Espresso) that serves as training data for the machine-learning workflow of the related scientific publication... -
Teaching ML in Compact Courses
This talk summarizes the experiences made with teaching Machine Learning within compact events that stretch over several days to a week maximum. Both speakers explain pitfalls... -
UVP5 data sorted with EcoTaxa and MorphoCluster
Here, we provide plankton image data that was sorted with the web applications EcoTaxa and MorphoCluster. The data set was used for image classification tasks as described in... -
Coral fragments health and growth in the Reefscapers propagation project in t...
The Reefscapers program is a coral restoration initiative to help reef recovery in the Maldives. It started in 2001, and is most active since the 2015-2016 mass bleaching event.... -
Daily sea level anomalies from satellite altimetry with Random Forest Regression
The sea level observations from satellite altimetry are characterised by a sparse spatial and temporal coverage. For this reason, along-track data are routinely interpolated... -
The high-resolution topsoil plant-available phosphorus map of Estonia
The high-resolution (1:10,000) hybrid topsoil plant-available phosphorus map was produced by combining: (a) arable-land polygons with the median topsoil P value, (b) the machine... -
Towards physics-based deep learning in OpenFOAM: Combining OpenFOAM with the ...
Source Code and Data snapshot accompanying the Training " Towards physics-based deep learning in OpenFOAM: Combining OpenFOAM with the PyTorch C++ API" given at the 17th...
