Corpus for the epidemiomonitoring of plant

DOI

The corpus is the collection of 165 documents on plant health to which the manual annotations of the 'Training and development dataset for information extraction in plant epidemiomonitoring' apply. The documents are public web documents about quarantine pest in Europe that have been pre-processed and translated in English. The annotations in the Training and development dataset refer to character positions within the documents of the corpus. Both datasets are intended for the training and validation of information extraction methods.

Identifier
DOI https://doi.org/10.57745/YKSEPY
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.57745/YKSEPY
Provenance
Creator MaIAGE; Plateforme ESV
Publisher Recherche Data Gouv
Contributor Claire Nédellec; Marie Grosdidier; Robert Bossy; Sandy Duperier; Isabelle Pieretti; MaIAGE; Plateforme ESV; Louise Deléger; Entrepôt-Catalogue Recherche Data Gouv
Publication Year 2025
Funding Reference Agence nationale de la recherche ANR-20-PCPA-0002 ; INRAE ; PIA DATAIA
Rights info:eu-repo/semantics/openAccess
OpenAccess true
Contact Claire Nédellec (INRAE); Marie Grosdidier (INRAE)
Representation
Resource Type Dataset
Format application/zip
Size 258464
Version 1.0
Discipline Agriculture, Forestry, Horticulture; Computer Science; Life Sciences; Agricultural Sciences; Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Medicine