<b>Myelin Basic Protein (MBP) Degradome Foundation Atlas</b>

DOI

The MBP Degradome Foundation Atlas (Version 1) is an open-access, fully reproducible dataset that provides the first comprehensive representation of the complete proteolytic degradome of Myelin Basic Protein (MBP). MBP is a key structural component of central nervous system myelin, and a central antigen in demyelinating diseases, including multiple sclerosis (MS) and other inflammatory neurological disorders.This atlas systematically enumerates and characterizes all possible proteolytic MBP peptide fragments defined by a curated set of experimentally and literature-supported cleavage sites. The degradome concept captures the dynamic pool of MBP fragments that arise through proteolysis in vivo full-length MBP. This dataset therefore aims to support biomarker discovery, mechanistic proteomics, and translational research in neuroimmunology and neurodegeneration.Scientific RationaleInstead of treating MBP as a single, static protein species, the degradome model reflects a biologically realistic landscape of coexisting proteolytic fragments. This is particularly relevant because:MBP undergoes extensive physiologic and pathologic proteolysis.Specific MBP fragments act as antigens and immunomodulatory signals.Fragment composition may vary with disease activity, genotype, or therapeutic intervention.Dataset ContentsThe compressed archive includes:MBP_WT.csv — full degradome of wild-type MBPMBP_R159K.csv — full degradome of the R159K MBP variantMBP_Degradome_All.csv — merged and unified dataset combining all included MBP variantsPython source code used to generate all peptide fragments and compute peptide featuresREADME.txt — structured technical documentationrequirements.txt — software dependency list for reproducibilityData FormatAll files are provided in CSV (comma-separated values) format and include the following annotated fields:id — structured peptide identifier (e.g., MBP_WT_10_42)peptide — amino acid sequencestart, stop — cleavage positionsmz — mass-to-charge ratioDa — molecular weightBoman — Boman indexcharge — net chargepI — isoelectric pointhydrophobicityinstability_indexaliphatic_indexThese properties enable integration into R, Python, SAS, Matlab, and machine learning workflows.Software and ReproducibilityThe dataset is built using open-source tools:Python 3pandas for data handlingpeptides library for physicochemical property computationsqlite3 for intermediate in-memory processingpsutil for optional resource monitoringAll scripts are fully documented in the repository, ensuring full reproducibility.ApplicationsThis dataset is intended for use in:Biomarker discovery and validation in demyelinating diseasesComputational proteomics, including cleavage pattern modelingCharacterisation of MBP variants and SNP-associated degradome changesMachine-learning feature engineering for immunoproteomicsStructural, immunological, and functional analysis of MBP fragmentsKeywords (for discoverability)Myelin basic protein, MBP, degradome, proteolysis, neuroimmunology, neurodegeneration, multiple sclerosis, demyelinating diseases, proteomics, peptide atlas, biomarker discovery, R159K, CNS autoimmunity, bioinformatics, mass spectrometry, immunoproteomics.

Identifier
DOI https://doi.org/10.5522/04/30579419.v1
Related Identifier HasPart https://ndownloader.figshare.com/files/59432651
Related Identifier HasPart https://ndownloader.figshare.com/files/59434091
Related Identifier HasPart https://ndownloader.figshare.com/files/59434367
Related Identifier HasPart https://ndownloader.figshare.com/files/59434370
Related Identifier HasPart https://ndownloader.figshare.com/files/59436029
Metadata Access https://api.figshare.com/v2/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:figshare.com:article/30579419
Provenance
Creator Petzold, Axel ORCID logo
Publisher University College London UCL
Contributor Figshare
Publication Year 2025
Rights https://creativecommons.org/publicdomain/zero/1.0/
OpenAccess true
Contact researchdatarepository(at)ucl.ac.uk
Representation
Language English
Resource Type Dataset
Discipline Basic Biological and Medical Research; Biochemistry; Biology; Chemistry; Life Sciences; Natural Sciences