Taxonomic or trait assignation of environmental sequences in regions with a poor taxonomic knowledge: case of river diatom metabarcoding with a new version of the annotated reference library Diat.barcode - supplementary data

DOI

Supplementary data of the manuscript --- Abstract ------

We present a new version of a barcoding reference library dedicated to diatoms, Diat.barcode v12, with newly published sequences, annotated with ecological, and biological traits and curated by a college of experts. We used this library in two different areas, one where the taxonomic coverage of the library was good (mainland France) and another where it was poor (French Guyana) with about 320 diatom samples collected for river monitoring. We show that a direct bioinformatic assignment of environmental sequences to traits has a strong interest in French Guyana where species knowledge is poor and therefore the proportion of assigned environmental sequences is much lower (12.8%) than trait assignation (30%). Using co-correspondence analyses, we show that species assignation dataset and trait assignation datasets were significantly correlated in 7 out of 13 cases in French Guyana, whereas they were always significantly correlated in Mainland France. This can be interpreted as an important loss of ecological information with species assignation in French Guyana, which is not observed in mainland France. This shows the value for ecological studies to use direct assignation of environmental sequences to traits in regions where taxonomic knowledge is poor.

Identifier
DOI https://doi.org/10.57745/IRRMXH
Related Identifier IsCitedBy https://doi.org/10.1051/limn/2025009
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.57745/IRRMXH
Provenance
Creator Nicolosi Gelis, Maria Mercedes ORCID logo; Cochero, Joaquín ORCID logo; Viollaz, Laurine ORCID logo; Briand, Jean-François ORCID logo; Barry-Martinet, Raphaëlle ORCID logo; Chonova, Teofana ORCID logo; Gassiole, Gilles (ORCID: 0000-0002-4072-755X); Kahlert, Maria ORCID logo; Keck, François ORCID logo; Kelly, Martyn ORCID logo; Kochoska, Hristina (ORCID: 0000-0001-5245-036X); Mann, David ORCID logo; Pfannkuchen, Martin ORCID logo; Trobajo, Rosa ORCID logo; Vasselon, Valentin ORCID logo; Vidakovic, Danijela ORCID logo; Wetzel, Carlos ORCID logo; Zimmermann, Jonas ORCID logo; Rimet, Frédéric (ORCID: 0000-0002-5514-869X)
Publisher Recherche Data Gouv
Contributor Rimet, Frédéric; Institut national de recherche pour l’agriculture, l’alimentation et l’environnement; Entrepôt Recherche Data Gouv
Publication Year 2025
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Rimet, Frédéric (CARRTEL ; INRAE, Université Savoie-Mont Blanc ; France)
Representation
Resource Type Dataset
Format text/comma-separated-values
Size 3528202; 1593356
Version 1.1
Discipline Geosciences; Earth and Environmental Science; Environmental Research; Natural Sciences