This repository contains all code, data preprocessing steps, and visualisation workflows used to analyse scientific literature related to CHO and HEK293 cell lines, with a focus on comparing publications that mention bioprocessing terms versus those that do not.Contents:Jupyter Notebook:The main analysis notebook includes:Designed for reproducibility and exploratory insight.requirements.txt:A list of all Python packages and their versions required to run the notebook. This ensures the analysis is fully reproducible in any compatible environment.README.md:Contains instructions for environment setup, data expectations, and a brief overview of each major section of the notebook.