Software

 

Single-cell RNA-seq

 

Bulk RNA-seq isoform discovery and quantification

  • SLIDE (Sparse Linear modeling of RNA-Seq data for Isoform Discovery and abundance Estimation)
    • Citation: Li, J.J., Jiang, C.-R., Brown, B.J., Huang, H., and Bickel, P.J. (2011). Sparse linear modeling of RNA-seq data for isoform discovery and abundance estimation. Proc Natl Acad Sci. USA 108(50):19867-19872.
    • Important note: SLIDE is compatible with RNA-seq .bam files mapped by TopHat and TopHat2.
    • (Updates on Jan 30th, 2018 -- Several bugs have been fixed for RNA-seq .bam files with more than one read lengths.)
    • (Updates on May 7th, 2012 -- A feature was added for estimating the annotated isoform abundance without doing isoform discovery.)
    • (Updates on Apr 18th, 2012 -- A feature was added for removing erroneously mapped reads; a bug of multiprocessing was fixed.)
    • (Updates on Apr 5th, 2012 -- A feature was added for handling single-end RNA-Seq reads or a mixture of single-end and paired-end reads.)

 

Comparative genomics

 

Microbiomics

 

Classification

 

Association measure

 

High-dimensional model inference

 

False discovery rate control

 

Data

 

  • Estimates of D. melanogaster and C. elegans gene expression in different developmental stages, tissues and cells (in FPKM units)
    • D. melanogaster gene expression estimates in 30 fly developmental stages (download)
    • D. melanogaster gene expression estimates in 29 fly tissues and 19 fly cell lines (download)
    • C. elegans gene expression estimates in 35 worm developmental stages (download)
    • C. elegans gene expression estimates in 4 worm tissues and 14 worm dissected cells (download)
    • Please cite the following paper in any research that uses the above data
      • Li, J.J., Huang, H., Bickel, P.B., and Brenner, S.E. (2014). Comparison of D. melanogaster and C. elegans developmental stages, tissues, and cells by modENCODE RNA-seq data. Genome Research 24(7):1086-1101.
    • For more details about the data, please refer to the section "Estimating gene expression in developmental stages and tissues/cells" in the Methods of the above paper ([html] [pdf]).

 

  • Associated promoter and enhancer regions identified based on signals of three histone modification marks (H3K4me1, H3K4me3 and H3K27ac) in 16 human tissue and cell types (download)

 

  • Estimates of gene expression (FPKM) in various cell and tissue types from human, chimpanzee, bonobo and mouse
    • Expression estimates of protein-coding genes in human (download)
    • Expression estimates of protein-coding genes in chimpanzee (download)
    • Expression estimates of protein-coding genes in bonobo (download)
    • Expression estimates of protein-coding genes in mouse (download)
    • Expression estimates of protein-coding genes in pig (download)
    • Expression estimates of long non-coding RNAs in human (download)
    • Expression estimates of long non-coding RNAs in chimpanzee (download)
    • Expression estimates of long non-coding RNAs in bonobo (download)
    • Expression estimates of long non-coding RNAs in mouse (download)
    • Please cite the following paper in any research that uses the above data
      • Yang et al. Large-scale mapping of mammalian transcriptomes identifies conserved genes associated with different cell states. Nucleic Acids Research 45(4):1657–1672.
    • For more details about the data, please refer to the section "RNA-seq data collection and processing" in the Methods of the above paper ([html] [pdf]).

 

  • Data for the R package Clipper (download)