Repository with the scripts used for the calculations in the manuscript "Disease coverage, overlap and divergence of human genome-wide association studies and pharmaceutical research and development". It could be used to revisit the estimates in the future.Overview of the scripts:- gwas_cat_mapping_code: includes the python code (.ipynb
) used to map the traits in the GWAS Catalog to UMLS terms as described in Materials and Methods.- gwas_clinicaldev_code: includes the python code (.ipynb
) used to identify the human diseases evaluated in drug development and in GWAS as s described in Materials and Methods. It also includes the code to generate Figure 1, Figure 2 and Figure 4 of the manuscript.