Oomycete diversity determined via metabarcoding in two embayments in the Orkney Islands, Scotland, UK in 2016

DOI

Water sampling was carried out weekly between April and October 2016. Water samples were taken from two shallow sites (less than 5 km apart) in the Orkneys. Sampling was done using a bucket at both sampling locations. Water for filtration was transported to the Orkney Lobster Hatchery where 500–1000 mL of water was filtered over 0.4 µm polycarbonate (PC) filter (Whatman, 47mm). Filters were stored at -20 °C until DNA isolation in the laboratory. The samples were processed at the AWI to produce a metabarcoding data set with the goal of analysing the general diversity at the two sites but also to investigate the dynamics of harmful algal bloom species (HABs). As the data also contained a wealth of information on protistan parasites they were used to produce an additional manuscript on oomycete infections in the planktonic foodweb at the two sites.The bioinformatic processing of the raw sequence files was conducted as follows. The low-quality 3'-ends of the reads were trimmed by Trimmomatic (version 0.38) and the paired-ends were merged by VSEARCH (version 2.3.0). Cutadapt (version 1.19, was used to adjust the sequence orientation and to remove the forward and reverse primer matching sequence segments. Sequences were only kept in the sequence pool if both primer matching segments could be detected. To ensure a high sequence quality the remaining sequences were filtered by VSEARCH and only kept if the expected base error (sum of all base error probabilities) of a sequence was above 0.25. Chimeric sequences were predicted sample-wise by VSEARCH in de novo mode with default settings and removed from the sequence pool as well. Only samples which consisted of at least 10000 sequences after filtering were considered for further analyses. The remaining sequences were clustered into OTUs by the tool swarm (version 2.2.2) with default settings. For each OTU the most abundant amplicon was selected as representative and taxonomically annotated with the default classifier implemented in mothur (version 1.38.1). As reference the Protist Ribosomal Reference database (PR2), version 4.10 was chosen and the minimum confidence cut-off for annotation was set to a value of 80. Further details of the pipeline are provided in (Sprong, et al. 2020; doi:10.1016/j.seares.2020.101914).

Identifier
DOI https://doi.org/10.1594/PANGAEA.942305
Metadata Access https://ws.pangaea.de/oai/provider?verb=GetRecord&metadataPrefix=datacite4&identifier=oai:pangaea.de:doi:10.1594/PANGAEA.942305
Provenance
Creator Sprong, Pim; Kraberg, Alexandra Claudia ORCID logo; Metfies, Katja
Publisher PANGAEA
Publication Year 2023
Rights Creative Commons Attribution 4.0 International; https://creativecommons.org/licenses/by/4.0/
OpenAccess true
Representation
Resource Type Dataset
Format text/tab-separated-values
Size 8 data points
Discipline Earth System Research
Spatial Coverage (-2.901W, 58.885S, -2.751E, 58.922N); Orkneys
Temporal Coverage Begin 2016-04-10T00:00:00Z
Temporal Coverage End 2016-10-19T00:00:00Z