A Database of Drosophila Genes & Genomes

FB2013_03, released May 7th, 2013
 

Reference Report

Reference
Citation Raychaudhuri, S., Chang, J.T., Imam, F., Altman, R.B. (2003). The computational analysis of scientific literature to define and recognize gene expression clusters.  Nucleic Acids Res. 31(15): 4553--4560. (Export to RIS)
FlyBase ID FBrf0160870
Publication Type Research paper
PubMed ID 12888516
PubMed Abstract A limitation of many gene expression analytic approaches is that they do not incorporate comprehensive background knowledge about the genes into the analysis. We present a computational method that leverages the peer-reviewed literature in the automatic analysis of gene expression data sets. Including the literature in the analysis of gene expression data offers an opportunity to incorporate functional information about the genes when defining expression clusters. We have created a method that associates gene expression profiles with known biological functions. Our method has two steps. First, we apply hierarchical clustering to the given gene expression data set. Secondly, we use text from abstracts about genes to (i) resolve hierarchical cluster boundaries to optimize the functional coherence of the clusters and (ii) recognize those clusters that are most functionally coherent. In the case where a gene has not been investigated and therefore lacks primary literature, articles about well-studied homologous genes are added as references. We apply our method to two large gene expression data sets with different properties. The first contains measurements for a subset of well-studied Saccharomyces cerevisiae genes with multiple literature references, and the second contains newly discovered genes in Drosophila melanogaster; many have no literature references at all. In both cases, we are able to rapidly define and identify the biologically relevant gene expression profiles without manual intervention. In both cases, we identified novel clusters that were not noted by the original investigators.
DOI
Related Publication(s)
hide Recent Updates
Description
What does this section display?
This section contains items that were added to this record for each release. It currently only tracks new links between this FlyBase report and other FlyBase data classes (e.g. genes, references, stocks) or controlled vocabulary terms (e.g. GO, anatomy terms).
What does this section not display?
This section does not currently display links that were removed or gene model changes.
Update Feed
Click the icon below to subscribe to this FlyBase record and receive updates automatically through your feed reader.
FB2013_03
FB2013_02
All updates Click here to see a list of all updates to this record from FB2010_08 and on.
hide Associated Information
Comments
Associated Files
hide Other Information
Secondary IDs
Language of Publication English
Additional Languages of Abstract
Also Published As
hide Parent Publication
Publication Type Journal
Abbreviation Nucleic Acids Res.
Title Nucleic Acids Research
Publication Year 1974-
ISBN/ISSN 0305-1048
hide Data from Reference
hideGenes (32)