Simulated Dataset for assessment shotgun metagenomics short read methodology

DOI

DWGSIM (v 0.1.13) has been used to simulate shotgun metagenomics short reads from Illumina sequencing platforms. 52,8 millions of paired-ends reads were extracted from 32 genomes retrieved from NCBI database to simulate gut microbial communities. Per-base error rate per reads is set at 0.0001 on first and second read. Mutation rate is also set at 0.001 and fraction of these mutations that were indels is 0.1. Probability that an indel is extend is zero. Genome coverage ranges from 1X to 200X. Two mocks of 52,8 millions of paired-ends reads are available and the only difference between the two mocks are the genome covers. Description of these genomes (taxonomies, genomes length, number of reads and cover) are fully described in metadata files.

Identifier
DOI https://doi.org/10.15454/80BIQK
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.15454/80BIQK
Provenance
Creator Joanna, FOURQUET
Publisher Recherche Data Gouv
Contributor Géraldine, PASCAL
Publication Year 2022
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Géraldine, PASCAL (INRAE)
Representation
Resource Type Dataset
Format application/vnd.ms-excel.sheet.macroEnabled.12; application/x-gzip
Size 14922; 14731; 3567950980; 3568004128; 3727425641; 3727489878
Version 1.0
Discipline Life Sciences; Medicine