This RNA-seq dataset contains gene expression data from leaves and roots of tomato plants genetically transformed to express the four enzymes required for the biosynthesis of psoralen—a furanocoumarin not naturally produced in tomato—from p-coumaroyl-CoA. These transgenic plants are referred to as the "Psoralen Pathway" (PP) line. For comparison, RNA-seq data from non-transformed (wild-type, WT) tomato plants is also included.
Total RNA was extracted from leaves and root of 28 day-old plants, considering triplicates for PP and WT. After checking of the RNA quality (RIN>7), transcript data were obtained by RNA-Seq Illumina pair-end sequencing (2 x 150 pb) using the NovaSeq 6000 method for 30 M read pairs. The dataset contains 42 files corresponding to raw data.
In addition, raw data were cleaned using Trimmomatic (Bolger et al., 2014). The quality of the sequences was assessed using FastQC. The transcripts were mapped on the tomato genome (ITAG2.4) using the BWA-MEM software (Li & Durbin, 2009). Count tables were generated using the FeatureCounts software.
The data set contains also 12 files corresponfding to the count tables generated through this workflow.
Galaxy, 25.0
Trimmomatic, 0.39
BWA-MEM, 0.7.19
FeatureCounts, 1.6.4