n-Alkane Distributions of Modern Plants from Kenya and Niger

DOI

This dataset comprises plant wax n-alkane concentrations (C₂₅ - C₃₅) and corresponding relative abundances for 148 modern African angiosperm samples, including trees, shrubs, grasses, herbs, vines, and palms. Samples were collected from three regions: Samburu National Reserve (SNR) in Kenya (n = 99), Lothagam in Kenya (n = 1), and multiple locations across Niger (n = 48). The SNR samples were collected between October 2001 and August 2002 and include repeated sampling of the same individual plants, allowing assessment of temporal variability in n-alkane distributions. The Lothagam and Niger samples were collected in April and November 2019, respectively. All specimens were identified to at least the family level, with most resolved to genus or species, and are accompanied by metadata including plant functional type and photosynthetic pathway. Sample preparation and analysis were conducted at Lamont-Doherty Earth Observatory between 2018 and 2021. n-Alkanes were quantified using gas chromatography with a mass selective detector (GC-MSD) and flame ionization detector (FID). Response factor corrections were applied to peak areas to obtain concentration values. Odd-carbon-number n-alkanes (C₂₅ - C₃₅) were normalized to unit sum to generate relative abundance distributions. Trees, shrubs, and grasses from this dataset formed part of the training and validation data for Tweedy et al. (2026), where machine learning models differentiated African woody and grassy samples with up to 89% validation accuracy. The 19 herb, palm, and vine samples included here are unique to this dataset and not published elsewhere. This dataset provides a reference for evaluating n-alkane chemotaxonomic signals and supports applications in vegetation reconstruction.

Identifier
DOI https://doi.org/10.1594/PANGAEA.995255
Metadata Access https://ws.pangaea.de/oai/provider?verb=GetRecord&metadataPrefix=datacite4&identifier=oai:pangaea.de:doi:10.1594/PANGAEA.995255
Provenance
Creator Tweedy, Ruth ORCID logo; Shi, Sarah; Uno, Kevin
Publisher PANGAEA
Publication Year 2026
Funding Reference National Science Foundation https://doi.org/10.13039/100000001 Crossref Funder ID EAR 19-45446 https://www.nsf.gov/funding/opportunities/career-faculty-early-career-development-program CAREER: Developing novel biomarker proxies to constrain Neogene changes in African woody cover and paleoecological contexts of hominin evolution
Rights Creative Commons Attribution 4.0 International; Data access is restricted (moratorium, sensitive data, license constraints); https://creativecommons.org/licenses/by/4.0/
OpenAccess false
Representation
Resource Type Dataset
Format text/tab-separated-values
Size 7128 data points
Discipline Earth System Research
Spatial Coverage (2.109W, 0.568S, 37.528E, 16.990N)
Temporal Coverage Begin 2001-10-06T00:00:00Z
Temporal Coverage End 2019-11-09T00:00:00Z