The EPOP corpus is the collection of 247 documents on plant health. The documents are public web documents about quarantine pest in Europe that have been pre-processed and translated to English. The documents are split into a training (110), a development (55) and a test (82) sets. The gold-standard annotation for the training and development sets are available on "Training and development dataset for information extraction in plant epidemiomonitoring" dataset. Both datasets are intended for the training and evaluation of information extraction methods. The EPOP dataset is the basis for the PestCLEF task of the LifeCLEF 2026 challenge.