Dawnn training code and resulting model

DOI

This project is a collection of files to allow users to reproduce the model development and benchmarking in "Dawnn: single-cell differential abundance with neural networks" (Hall and Castellano, under review). Dawnn is a tool for detecting differential abundance in single-cell RNAseq datasets. It is available as an R package here. Please contact us if you are unable to reproduce any of the analysis in our paper. The files in this collection correspond to the code used to train Dawnn's neural network model (following the model selection and hyperparameter optimisation in 10.5522/04/22634416/nn_model_choice.py) and the resulting trained model.

FILES: Training code

train_nn_regen_seed_123.py Python script to train final selected Dawnn model (Loads Keras and Tensorflow; Loads training data in labels_df_autogen4_rerun.csv; Creates neural network architecture; Trains model using early stopping, learning rate 0.00001, and batch size 32; Saves trained model as final_model_dawnn_rerun.h5). train_final_model_regen_seed_123_job_sub.sh Bash script to submit train_nn_regen_seed_123.py to SGE cluster.

Trained model

final_model_dawnn_rerun.h5 Final trained Dawnn model output by train_nn_regen_seed_123.py.

Identifier
DOI https://doi.org/10.5522/04/22633606.v1
Related Identifier https://ndownloader.figshare.com/files/40162366
Related Identifier https://ndownloader.figshare.com/files/40163188
Related Identifier https://ndownloader.figshare.com/files/40163191
Metadata Access https://api.figshare.com/v2/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:figshare.com:article/22633606
Provenance
Creator Hall, George; Castellano Hereza, Sergi
Publisher University College London UCL
Contributor Figshare
Publication Year 2023
Rights https://creativecommons.org/publicdomain/zero/1.0/
OpenAccess true
Contact researchdatarepository(at)ucl.ac.uk
Representation
Language English
Resource Type Software
Discipline Other