Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A reanalysis of mouse ENCODE comparative gene expression data

F1000Res. 2015 May 19:4:121. doi: 10.12688/f1000research.6536.1. eCollection 2015.

Abstract

Recently, the Mouse ENCODE Consortium reported that comparative gene expression data from human and mouse tend to cluster more by species rather than by tissue. This observation was surprising, as it contradicted much of the comparative gene regulatory data collected previously, as well as the common notion that major developmental pathways are highly conserved across a wide range of species, in particular across mammals. Here we show that the Mouse ENCODE gene expression data were collected using a flawed study design, which confounded sequencing batch (namely, the assignment of samples to sequencing flowcells and lanes) with species. When we account for the batch effect, the corrected comparative gene expression data from human and mouse tend to cluster by tissue, not by species.

Keywords: ENCODE; RNA-seq; developmental pathways; flowcells; sequencing.