-
WMT16 APE Shared Task Data
Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to... -
WMT17 Quality Estimation Shared Task Training and Development Data
Training and development data for the WMT17 QE task. Test data will be published as a separate item. This shared task will build on its previous five editions to further examine... -
WMT18 Quality Estimation Shared Task Test Data
Test data for the WMT18 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-2619. This shared task will build on its previous six editions to further... -
WMT17 Quality Estimation Shared Test Data
Test data for the WMT17 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-1974 This shared task will build on its previous five editions to further... -
Datasets for "EmbryoNet: Using deep learning to link embryonic phenotypes to ...
This is the data repository of the training and test data sets for EmbryoNet. The data is structured in multiple packages. EmbryoNet_Models (DOI 10.48606/31) contains the... -
Digital soil mapping predicted on mid-infrared (MIR) spectroscopy measurement...
Soil information is valuable for many disciplines (e.g. agriculture, geomorphology, geology, archaeology) and can be used to produce maps or statistics on soil productivity. As... -
Soil bulk density and soil depth from on-site observations in the North-Weste...
Soil information is valuable for many disciplines (e.g. agriculture, geomorphology, geology, archaeology) and can be used to produce maps or statistics on soil productivity. As... -
Soil properties in the North-Western Kurdistan region, Iraq, derived from lab...
Soil information is valuable for many disciplines (e.g. agriculture, geomorphology, geology, archaeology) and can be used to produce maps or statistics on soil productivity. As... -
Soil properties predicted on mid-infrared (MIR) spectroscopy measurements in ...
Soil information is valuable for many disciplines (e.g. agriculture, geomorphology, geology, archaeology) and can be used to produce maps or statistics on soil productivity. As... -
Victoria Land Cover Map: Random Forest Classification Using Sentinel-2 Imager...
The land cover mapping of Victoria, Australia, for 2021/22 was conducted using Sentinel-2 satellite imagery and the random forest machine learning algorithm. This map represents... -
Seasonal hydroclimate recorded in high resolution δ18O profiles across Pinus ...
The trees sampled in this study are growing at the Persimmon Gully Nature Preserve (30º 19' N, 93º 32' W, 15 masl) in southwestern Louisiana. Four cores (2A, 3B, 15A, 15B; all... -
Datasets for "Uncovering developmental time and tempo using deep learning"
This is the data repository for training and testing the Twin Network. The imaging data repositories are divided into several packages based on independent experiments. The data... -
Predicting electronic screening for fast Koopmans spectral functional calcula...
Koopmans spectral functionals are a powerful extension of Kohn-Sham density-functional theory (DFT) that enable the prediction of spectral properties with state-of-the-art... -
Crystallization kinetics in Ge-rich Ge<sub>x</sub>Te alloys from large scale ...
A machine-learned interatomic potential for Ge-rich GexTe alloys has been developed aiming at uncovering the kinetics of phase separation and crystallization in these materials.... -
Replication Data for: Quantifying Uncertainty in Foraminifera Classification:...
This dataset contains PNG images of individual foraminifera and sediment grains. The dataset contains a set of training images, and a set of test images. Each of the sets... -
Data for "High-resolution soil moisture mapping in northern boreal forests u...
Datasets associated with Jääskeläinen et al. manuscript titled: "High-resolution soil moisture mapping in northern boreal forests using SMAP data and downscaling techniques".... -
Teaching oxidation states to neural networks
The accurate description of redox reactions remains a challenge for first-principles calculations, but it has been shown that extended Hubbard functionals (DFT+U+V) can provide... -
Training data for "Harnessing Machine Learning for Single-Shot Measurement of...
This repository contains data for the NeurIPS conference paper titled "Harnessing Machine Learning for Single-Shot Measurement of Free Electron Laser Pulse Power". Raw data is... -
Data for "Flow Annealed Importance Sampling Bootstrap meets Differentiable Pa...
Training data for workshop paper "Flow Annealed Importance Sampling Bootstrap meets Differentiable Particle Physics" -
MeerKAT: Meerkat Kalahari Audio Transcripts
A large-scale reference dataset for bioacoustics Please find the accompanying code at our official repository: github.com/livingingroups/animal2vec [Optional ]You can find the...