The dataset contains the supplementary material that was created to identify and investigate long non-coding RNA (lncRNA) in the whole brain tissues of crucian carp (Carassius carassius), i.e. the annotation of lncRNAs with the four algorithms (FEELnc, CNCI, CPAT and CPC2), the result of differential expression analysis of the identified lncRNAs, and the characterization of predicted interaction partners and results from analysis of their differential gene expression. All raw RNA sequencing data are deposited in the NCBI Sequence Read Archive (SRA) under BioProject ID PRJNA386629 (http://www.ncbi.nlm.nih.gov/bioproject/386629). The genome sequence and annotation data were obtained from DataverseNO (https://doi.org/10.18710/GXMSUH). The scripts are available in the GitHub repository LncRNA (https://github.com/MagdalenaWinklhofer/LncRNA.git).
FASTQC, 0.11.8
TrimGalore, 0.3.3
HISAT2, 2.2.1
SAMtools, 1.17
StringTie, 2.2.1
GFFread, 0.9.0
CPC2, 1.0.1
Biopython, 1.81
CPAT, 3.0.4
CNCI, 2
FEELnc, 0.2.1
Subread, 2.0.3
DESeq2, 1.40
goseq, 1.52